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; Search time 1703.81 Seconds 
(without alignments) 
7460.491 Million cell updates/sec 



OM nucleic - nucleic search, using sw model 
Run on: December 13, 2003, 04:37:24 

Title: US-09-852-261-5 
Perfect score: 523 

Sequence: 1 ggaccggagacgctctgcgg aaatacacaagtaaacattc 523 

Scoring table: I DENTI TY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 22781392 seqs, 12152238056 residues 

Total number of hits satisfying chosen parameters: 45562784 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post -processing : 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 
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em_estba : * 

em__esthum: * 

em_estin: * 

em_estmu: * 

em_estov: * 

em_estpl : * 

em_estro : * 

em_htc : * 

gb_estl : * 
gb_est2:* 
gb_htc:* 
gb_est3 : * 
gb_est4:* 
gb_est5 : * 
em_estfun : * 
em__estom: * 
em_gss_hum: * 
em_gss__inv: * 
em_gss_pln : * 
em_gss_vrt : * 
em_gss_f un : * 
em_gss_mam: * 
em_gss_mus : * 
em__gss_pro : * 
em_gss_rod: * 
em_gss_phg : * 
em_gss_yrl : * 



28: gb_gssl:* 
29: gb_gss2:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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RESULT 1 

AI503976/C 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



AI503976 558 bp mRNA linear EST ll-MAR-1999 

vm43d08.xl Stratagene mouse diaphragm (#937303) Mus musculus cDNA 
clone IMAGE: 1001007 3' similar to gb:X04482 Mouse mRNA for 
preproinsulin-like growth factor IB (MOUSE) ; , mRNA sequence. 
AI503976 

AI503976.1 GI:4401827 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 558) 

Marra,M., Hillier,L. # Kucaba,T. # Martin, J. , Beck,C, Wylie,T. , 

Underwood, K. , Steptoe,M., Theising,B., Allen,M., Bowers, Y w Person 

,B., Swaller,T., Gibbons, M., Pape,D., Harvey, N., Schurk,R., Ritter 

,E., Kohn,S., Shin,T., Jackson, Y., Cardenas,M., McCann,R., 

Waterston,R. and Wilson, R. 

The WashU-NCI Mouse EST Project 1999 

Unpublished 

Contact: Marra M/WashU-NCI Mouse EST Project 1999 
Washington University School of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108, USA 

Tel: 314 286 1800 

Fax: 314 286 1810 

Email: mouseest@watson.wustl.edu 

This clone is available royalty- free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
MGI: 565223 

This clone was previously sequenced on the 5' end only, this new 

data is from the 3 ' end 

High quality sequence stop: 440. 

Location/Qualif iers 

1. .558 

/organism="Mus musculus" 
/mol_type= "mRNA" 
/db_xref ="taxon: 10090" 
/clones "IMAGE: 1001007" 
/tissue_type= "diaphragm" 
/dev_stage="adult " 

/lab_host="SOLR (kanamycin resistant) " 
/clone_lib="Stratagene mouse diaphragm (#937303)" 
/note=" Organ: diaphragm; Vector: pBluescript SK- ; Site_l: 
EcoRI; Site_2: Xhol ; Cloned unidirect ionally from mRNA 
prepared from diaphragm muscle. Primer: Oligo dT. Average 
insert size: 1.5 kb. Uni-ZAP XR Vector; -5' adaptor 
sequence: 5' GAATTCGGCACGAG 3' -3' adaptor sequence: 5' 

103 a 133 c 149 g 173 t 



Query Match 69.8%; Score 3 64.8; DB 9; 

Best Local Similarity 83.2%; Pred. No. 8.8e-81; 
Matches 441; Conservative 0; Mismatches 82; 



Length 558; 

Indels 7; Gaps 2; 
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RESULT 2 

AW146128/C 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



AW146128 623 bp mRNA linear EST 10-OCT-2000 

um37el0.xl Sugano mouse embryo mewa Mus musculus cDNA clone 
IMAGE: 22474 98 3' similar to gb:X04482 Mouse mRNA for 
preproinsulin-like growth factor IB (MOUSE);, mRNA sequence. 
AW146128 

AW146128.1 GI:6167864 
EST . 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 {bases 1 to 623) 

Marra,M., Hillier,L., Kucaba,T. , Martin, J., Beck,C, Wylie,T., 
Underwood,K. , Steptoe,M., Theising,B., Allen, M. , Bowers # Y. # Person 



TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



contact the 
information . 



BASE COUNT 
ORIGIN 



, B. , Swaller,T., Gibbons, M., Pape,D., Harvey ,N., Schurk,R., Ritter 

, E., Kohn,S., Shin,T., Jackson, Y., Cardenas, M. , McCann,R., 

Waterston,R. and Wilson, R. 

The WashU-NCI Mouse EST Project 1999 

Unpublished 

Contact: Marra M/WashU-NCI Mouse EST Project 1999 
Washington University School of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108, USA 

Tel: 314 286 1800 

Fax: 314 286 1810 

Email: mouseest@watson.wustl.edu 

This clone is available royalty- free through LLNL 
IMAGE Consortium (info@image.llnl.gov) for further 
MGI: 1006958 

Seq primer: custom primer used 
High quality sequence stop: 4 99. 

Location/Qualifiers 

1. .623 

/ organ ism="Mus mus cuius" 
/mol_type="mRNA" 
/strain="C57BL" 
/db_xref="taxon: 10090" 
/clone=" IMAGE: 22474 98" 
/dev_stage=" embryo, 14 dpc" 
/lab_host="DH10B" 

/clone_lib="Sugano mouse embryo mewa" 

/nqte=" Vector : pME18S-FL3; Site_l: Dralll (CACTGTGTG) ; 
Site_2 : Dralll (CACCATGTG) ; 1st strand cDNA was primed 
with an oligo(dT) primer [ATGTGGCCTTTTTTTTTTTTTTTTT] ; 
double -stranded cDNA was ligated to a Drain adaptor 
[TGTTGGCCTACTGG] , digested and cloned into distinct Drain 
sites of the pME18S-FL3 vector (5» site CACTGTGTG, 3' site 
CACCATGTG) . Xhol should be used to isolate the cDNA 
insert. Size selection was performed to exclude fragments 
<1.5kb. Library constructed by Dr. Sumio Sugano 
{University of Tokyo Institute of Medical Science) . 
Custom primers for sequencing: 5' end primer 
CTTCTGCTCTAAAAGCTGCG and 3 ' end primer 
CGACCTGCAGCTCGAGCACA . " 
123 a 138 c 170 g 191 t 1 others 



Query Match 69.4%; Score 363; DB 9; Length 623; 

Best Local Similarity 81.9%; Pred. No. 2.5e-8 0; 

Matches 433; Conservative 0; Mismatches 90; Indels 6; Gaps 
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541 GGACCAGAGACCCTTTTCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCG 4 82 
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481 AGGGGCTTTTACTTCAACAAGCCCA.CAGGCTATGGCTCC^GCATTCGGAGGGCACCTCAG 422 
121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAG CTGTGATCTGAGGAGG CTGGAGATGTAC 18 0 

1 1 1 1 ! 1 1 1 IMIMMIM IMMMIMIMI IMIMIIM Mill MM 

421 ACAGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGACTGGAAATGTAC 362 
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RESULT 3 

AI169253/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



AI169253 549 bp mRNA linear EST 08-JAN-1999 

EST215088 Normalized rat kidney, Bento Soares Rattus sp. cDNA clone 
RKIBP33 3' end, mRNA sequence. 
AI169253 

AI169253.1 GI:4134375 
EST. 

Rattus sp. 
Rattus sp. 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; 
Rattus . 

1 (bases 1 to 549) 

Lee , N . H . , Glodek,A. , Chandra , I . , Mason, T. M. , Quackenbush, J. , 
Kerlavage , A . R . and Adams ,M.D. 

Rat Genome Project: Generation of a Rat EST (REST) Catalog & Rat 

Gene Index 

Unpublished 

On Oct 6, 1998 this sequence version replaced gi: 3705561. 
Other__ESTs: TC50779 
Contact: Lee, NH 

The Institute for Genomic Research 

9712, Medical Center Drive, Rockville, MD 20850, USA 
Tel: (301) -838-3529 
Fax: (301) -838-0208 
Ema i 1 : nhl ee@t igr . org 
Seq primer: M13-21. 

Location/Qualifiers 
1. .549 

/organism= M Rattus sp." 



/ mo 1 _t yp e = " mRNA " 

/ db_xr e f = " t axon : 1 0 1 1 8 " 

/clone= n RKIBP33" 

/clone_lib="Normalized rat kidney, Bento Soares" 

/not e=" Organ: kidney; Vector: pT7T3Pac; Site_l : EcoRI ; 

Site_2: Not 1 11 

BASE COUNT 112 a 140 C 133 g 164 t 

ORIGIN 

Query Match 66.6%; Score 348.2; DB 9; Length 549; 

Best Local Similarity 81.8%; Pred. No. 1.3e-76; 

Matches 428; Conservative 0; Mismatches 88; Indels 7; Gaps 2; 
Qy 8 AGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGACAGGGGCT 67 

1 1 1 1 II Mill III III II I II Ml II II III I '11,111 IMIIM 

Db 54 9 AGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAATTCGTGTGTGGACCAAGGGGCT 4 9 0 

Qy 68 TTTATTTCAACAAGCCCACAGGATACGGCTC(^GCAGTCGGAGGGCACCT(^GA(^GGC^ 127 

1 1 1 1 MM MM Mill M II M M Ml MM I III I Mill II Mill MM 

Db 4 89 TTTACTTCAACAAGCCCACAGGCTATC 43 0 

Qy 128 TCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTACTGTGCAC 187 

I I II 1 1 1 1 1 M I 1 1 1 M II 1 1 1 1 II II 1 1 M 1 1 1 1 1 1 II II 1 1 1 1 1 1 II I M I M I 

Db 429 TTGTGGATGAGTGTAG CTTCCGGAGCTGTGATCTGAGGAGG CTGGAGATGTACTGTGCTC 370 

Qy 188 CCCTCAAGCCGGC^AGGCAGCCCGCTCCGTCCGTGCCC^GCGCCACACCGACATGCCC^ 247 

I II Mill Mill MM M Ml MM MMMMM 

Db 3 69 CGCTGAAGCCTAOW^GT(^GCTCGTTCCATCCGGGCCCAGCGCCA(^TTGACATGCC(^ 310 

Qy 248 AGACTCAGAAGTATCAGCCTCCATCT^ 307 

llllllllllll Mill I Ml II IIIIIIIM MM M MMMMM 

Db 309 AGACT CAGAAGT CC CAG CC C CTATCGACACACAAGAAAAGGAAG CTG CAAAGGAGAAGGA 250 

Qy 3 08 AAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAG 367 

IMIIIIIMI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MIMMMMIMM Mill III 

Db 24 9 AAGGAAGTACACTTGAAGAAC^CAAGTAGAGGAAGTGCAGGAAACAAGACTTACAGAATG 190 

Qy 368 TAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTCTGCA 427 

Mill II III I Mill I II I II I IMIIM II I Mill 1 1 

Db 18 9 TAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTGCTGCTTG 13 0 
Qy 42 8 CAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATCACATTTC 481 

I MUM MM III MM II III MM II MUM 

Db 12 9 AGCAACCTGCAAAACATCGGAACACCTC 70 

Qy 4 82 AAAGAT -GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

I MM 1 1 1 1 1 1 1 1 : MMMIIIM MMIMMIM 

Db 69 AGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTC 27 



RESULT 4 
AI265629/C 

LOCUS AI265629 558 bp mRNA linear EST 18-NOV-1998 

DEFINITION uj04b07.xl Sugano mouse liver mlia Mus musculus cDNA clone 
IMAGE: 1890901 3' similar to gb:X04482 Mouse mRNA for 
preproinsulin-like growth factor IB (MOUSE) ; , mRNA sequence. 
ACCESSION AI265629 



VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



AI265629.1 GI:3883787 
EST. 

Mus mus cuius (house mouse) 
Mus mus cuius 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 {bases 1 to 558) 

Marra,M., Hillier,L., Allen, M., Bowles, M., Dietrich, N . , Dubuque, T., 
Geisel,S., Kucaba,T. , Lacy,M., Le , M . , Martin, J . , Morris, M., 
Schellenberg, K. , Steptoe,M., Tan,F., Underwood, K. , Moore ,B., 
Theising,B., Wylie,T., Lennon,G., Soares,B., Wilson, R. and 
Waterston,R. 

The WashU-HHMl Mouse EST Project 
Unpublished 

Contact: Marra M/Mouse EST Project 

WashU-HHMI Mouse EST Project 

Washington University School of MedicineP 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108 

Tel: 314 286 1800 

Fax: 314 286 1810 

Email : mouseest@watson . wustl . edu 

This clone is available royalty- free through LLNL ; contact the 
IMAGE Consortium {infd@image.llnl.gov) for further information. 
MGI : 975225 

Seq primer: custom primer used 
High quality sequence stop: 4 95. 

Location/Qualifiers 

1. .558 

/organism="Mus mus cuius" 
/ mo 1 __t yp e = " mRNA " 
/strain="C57BL" 
/db_xref ="taxon: 10090" 
/clone=" IMAGE: 1890901" 
/sex=" female" 
/dev_stage= "adult " 
/lab_host="DH10B" 

/clone_lib="Sugano mouse liver mlia" 

/note= n 0rgan: liver; Vector: pME18S-FL3; Site_l: Drain 
(CACTGTGTG) ; Site_2 : Drain (CACCATGTG) ; 1st strand cDNA 
was primed with an oligo(dT) primer 

[ ATGTGG CCTTTTTTTTTTTTTTTTT ] ; double- stranded cDNA was 
ligated to a Drain adaptor [TGTTGGCCTACTGG] , digested 
and cloned into distinct Drain sites of the pME18S-FL3 
vector (5' site CACTGTGTG, 3' site CACCATGTG). Xhol should 
be used to isolate the cDNA insert. Size selection was 
performed to exclude fragments <1.5kb. Library 
constructed by Dr. Sumio Sugano (University of Tokyo 
Institute of Medical Science) . Custom primers for 
sequencing: 5' end primer CTTCTGCTCTAAAAGCTGCG and 3 f end 
primer CGACCTGCAGCTCGAGCACA . " 
106 a 135 c 156 g 161 t 



Query Match 66.3%; 
Best Local Similarity 82.0%; 
Matches 414; Conservative 



Score 347; DB 9; Length 558; 
Pred. No. 2.5e-76; 
0; Mismatches 85; Indels 6; Gaps 



1; 



Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Mill Mill II Mill MMMMMIMMIMIMMIIMMMIMM 

5 06 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCG 447 
61 AGGGGCTTTTATTTC^CAAGCCCACAGGATACGGCTGCAGCAGTCGGAGGGCACCTCAG 120 

MINIMI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II MUM I 1 1 1 1 1 1 1 1 1 1 . I ! 1 1 

44 6 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCTCAG 3 87 
121 ACAGGCAT CGTGGATGAGTGCTG CTTCCGGAG CTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

I I SIMM IIIIIIMIII III Ml III 1 1 II IM M I II I MUM Mill 

386 ACAGGCATTGTGGATGAGTGTTG CTT CCGGAG CTGTGATCTGAGGAGACTGGAGATGTAC 327 
181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

Mill II II Mill MM IIIIIIIIMI M 1 1 II I M 1 1 1 1 M ' III 

32 6 TGTGCCCCACTGAAGCCTACAAAAGCAGCCCGCTCTATCCGTGCCCAGCGCCACACTGAC 267 
241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAACAAGAAAATGAAGTCTCAGAGG 3 00 

MUM Mill llllll I I I I III II I III Ml III MM II III 

266 ATGCCCAAGACTCAGAAGTCCCCGTCCCTATCGACAAACAAGAAAACGAAGCTGCAAAGG 207 
301 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

1 1 M M M 1 1 M I II 1 1 1 M;i I II 1 1 1 1 1 1 1 1 1 IIIIIIIIIIIIIMI III 

2 06 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 147 

3 61 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 420 

III MM II Ml I MM I II I II IMIIIMM MUM 

14 6 CAGAATGTAGGAGGAGCCT CC CACGGAG CAGAAAATGCCACATCACCG CAGGATCCTTTG 87 
421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATC 474 



Mill 



II III III 



II MM 



8 6 CTGCTTGAGCAACCTGCAAAACATCGAAACACCTACCAAATAACAATAATAAGTCCAATA 27 
475 ACATTTCAAAGATGGCATTTCCCCC 4 99 

Mill Mill II MM 

26 ACATTACAAAGATGGGCATTTCCCC 2 



RESULT 5 
CD373004 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 



-17-0 
■17-0-UI 



614 bp mRNA linear EST 29-MAY-2003 
Ul.rl UI-R-GR0 Rattus norvegicus cDNA clone 
, mRNA sequence . 



;31157094 



CD373004 
UI-R-GRO-csv-j 
UI-R-GRO-csv-j 
CD373004 
CD373004.1 GI 
EST. 

Rattus norvegicus (Norway rat) 
Rattus norvegicus 
Eukaryota; Metazoa; 
Mammalia; Eutheria; 
Rattus. 

1 (bases 1 to 614) 

Bonaldo , M . F . , Lennon,G. and Soares,M.B. 

Normalization and subtraction: two approaches to facilitate gene 
discovery 

Genome Res. 6 (9), 791-806 (1996) 

97044477 

8889548 



Chordata; Craniata; Vertebrata; Euteleostomi ; 
Rodentia; Sciurognathi ; Muridae; Murinae; 



COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Contact : Soares , MB 

Coordinated Laboratory for Computational Genomics 
University of Iowa 

375 Newton Road , 4156 MEBRF, Iowa City, IA 52242, USA 

Tel: 319 335 8250 

Fax: 319 335 9565 

Email: bento-soares@uiowa.edu 

Tissue Procurement: James Lin, University of Iowa 
cDNA Library preparation: Dr. M. Bento Soares, University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Soares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 
Clone Distribution: Distribution information can be found at 
http: //genome .uiowa . edu/distribut ion/rat .html 
Seq primer: M13 REVERSE. 

Location/Qualifiers 
1. .614 

/organism="Rattus norvegicus" 
/ mo 1 _ t yp e = " mRNA " 
/strain="Sprague-Dawley" 
/db_xref-"taxon: 10116" 
/clone="UI -R-GRO -csv- j -17-0 -UI " 
/tissue_type= "Whole embryo" 
/dev_stage=" embryo 13dpc" 

/lab_host="DH10B (Life Technologies) (Tl phage resistant)" 
/clone_lib="UI -R-GRO" 

/note=" Vector : pYX-Asc; Site_l : EcoR I; Site_2: Not I; 
UI -R-GRO is a cDNA library containing the following 
tissue (s) : rat whole embryo 13dpc. The library was 
constructed according to Bonaldo, Lennon and Soares, 
Genome Research, 6:791-806, 1996. Denatured RNA was size 
fractionated on a 1% agarose gel . First strand cDNA 
synthesis was primed with oligo-dT primer containing a Not 
I site. Double strand cDNA was size selected according to 
mRNA size fraction, ligated with EcoR I adaptor, digested 
with NotI and then cloned directionally into pYX-Asc 
vector. The library tag sequence located between the Not I 
site and the polyA tail is CATCTCTACT. This library was 
created for the University of Iowa Program for Rat Gene 
Discovery and Mapping (Val Sheffield, Bento Soares and Tom 
Ca savant) . " 

171 a 168 c 154 g 119 t 2 others 



Query Match 64.9%; Score 339.6; DB 14; Length. 614; 

Best Local Similarity 81.4%; Pred. No. 1.9e-74; 

Matches 393; Conservative 0; Mismatches 90; Indels 0; Gaps 



0; 



Qy 



Db 



1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Mill Mill II Mill IIIIIIIIIMIII i M M !l I M 1 1 1 1 1 1 1 

116 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 175 



QY 
Db 

Qy 



61 AGGGGCTTTTATTTCAA(^GCC(^C^GGATACGGCTCCAGCAGTCGGAGGGC^CCTCAG 120 

1 1 1 1 1 E 1 1 M I 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 II IMIIMIM MMIMIMM Ml 

176 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACaVCAG 235 
121 ACAGG CATCGTGGATGAGTG CTGCTTCCGGAG CTGTGATCTGAGGAGG CTGGAGATGTAC 18 0 

II Mill IIIIIMIIII I I I M I ! I i Ml I I I MM 



Db 



236 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 2 95 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 



181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

Mill II II Mill Mill MM II III MM MM IIMM MM III 

2 96 TGTGCTCCGCTGAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 3 55 
241 ATGCCC^^GACTCAGAAGTATCAGCCTCCATCTACCAACAAGAAAATGAAGTCTCAGAGG 3 00 

III III II MM Mill 1 1 Mill I III II IIMIIIII MM II III 

356 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACA(^C^GAAAAGGAAGCTGCAAAGG 415 

3 01 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

M I M 1 1 1! II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 MMMIMMIMM III 

416 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 4 75 
361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 42 0 

III MIMMI II III I Mill I M II I IMMM II IIMM 

476 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATTCCACGTCACCGCATGATCCTTTG 535 
421 CTCTGC^CAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTT 48 0 

II I Mill MM Ml IIMM I II III II I 

53 6 CTGCTTGAGC^CCTGCANAACATCGGAACACCTGCCAAATATCAATAATGAGTTCAATA 5 95 
481 CAA 483 
596 CCA 598 



RESULT 6 
AI119218 
LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
COMMENT 



AI119218 816 bp mRNA linear EST 02-SEP-1998 

ue94h02.yl Sugano mouse embryo mewa Mus musculus cDNA clone 
IMAGE: 1498803 5 1 similar to gb:X04482 Mouse mRNA for 
preproinsulin-like growth factor IB (MOUSE) ; , mRNA sequence. 
AI119218 

AI119218.1 GI :3519542^ 
EST . 

"Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 816) 

Marra,M., Hillier,L., Allen, M. , Bowles,M., Dietrich, N. , Dubuque, T. , 
Geisel,S., Kucaba,T., Lacy,M., Le , M . , Martin, J., Morris, M., 
Schellenberg, K. , Steptoe,M. # Tan,F., Underwood, K. , Moore, B. # 
Theising,B., Wylie,T., Lennon,G., Soares,B., Wilson, R. and 
Waterston, R . 

The WashU-HHMI Mouse EST Project 
Unpublished 

Contact: Marra M/Mouse EST Project 

WashU-HHMI Mouse EST Project 

Washington University School of MedicineP 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108 

Tel: 314 286 1800 

Fax: 314 286 1810 

Email: mouseest@watson.wustl.edu 

This clone is available royalty-free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 



FEATURES 

source 



BASE COUNT 
ORIGIN 



MGI: 936407 

Seq primer: custom primer used 
High quality sequence stop: 473. 

Location/Qualifiers 

1. .816 

/organism="Mus musculus" 
/mol_type= " mRNA " 
/strain« ,! C57BL fi 
/db_xref="taxon: 100 90" 
/clones "IMAGE: 14 988 03" 
/dev_stage=" embryo, 14 dpc" 
/lab_host="DH10B" 

/clone_lib="Sugano mouse embryo mewa" 

/note="Vector: pME18S-FL3; Site_l: Drain (CACTGTGTG) ; 
Site_2: Drain (CACCATGTG) ; 1st strand cDNA was primed 
with an oligo(dT) primer [ATGTGGCCTTTTTTTTTTTTTTTTT] ; 
double-stranded cDNA was ligated to a Drain adaptor 

[TGTTGGCCTACTGG] , digested and cloned into distinct Drain 
sites of the pME18S-FL3 vector (5* site CACTGTGTG, 3* site 
CACCATGTG) . Xhol should be used to isolate the cDNA 
insert. Size selection was performed to exclude fragments 
<1.5kb. Library constructed by Dr. Sumio Sugano 

(University of Tokyo Institute of Medical Science) . 
Custom primers for sequencing: 5' end primer 
CTTCTGCTCTAAAAGCTGCG and 3 ' end primer 
CGACCTGCAGCTCGAGCACA . " 
230 a 219 c 172 g 187 t 8 others 



Query Match 64.9%; Score 33 9.2; DB 9; Length 816; 

Best Local Similarity 81.2%; Pred. No. 2.5e-74; 

Matches 389; Conservative 0; Mismatches 90; Indels 0; 



Gaps 



0; 



Qy 



Db 



1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Mill Mill M MMI M 1 1 II 1 1 M I II 1 1 1 1 II I II 1 1 1 1 1; II 1 1! 

323 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCG 3 82 



Qy 



Db 



61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 

MMIMIMI I I I I I I I I I I I I I M IMIIIIIII Ml I I I II I I I II I I 

3 83 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCTCAG 442 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

MM MM MIIMIMM III 1 1 1 1 M 1 1 1 1 II II I II I MMIMIMI 

443 ACAGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGACTGGAGATGTAC 502 
181 TGTGC^CCCCTC^GCCGGCAAAGGC^GCCCGCTCCGTCCGTGCCCAGCGCC^C^CCGA^ 24 0 

Mill M II Mill MM MMIMIMI II I M 1 1 1 II II II 1 1 M I III 

503 TGTGCCCCACTGAAGCCTACAAAAGCAGCCCGCTCTATCCGTGCCCAGCGCCACACTGAC 562 
241 ATGCCCAAGACTC^GAAGTATCAGCCTCC^TCTACCAACAAGAAAATGAAGTCTCAG 3 0 0 

MMMMMMIMIMI I I I I III M MMMIMI MM II III 

563 ATG C CCAAGACTCAGAAGT CCCCGTC CCTATCGACAAACAAGAAAACGAAGCTG CAAAGG 622 
3 01 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 I Ml II Mill MM III 

623 AGAAGGAAAGGAAGTACATTTGAAGAAC^C^GTAGAGGAAGTGCANGAAAC^AGACCTA 682 



Qy 
Db 

Qy 
Db 



361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 42 0 

Ml Mill 1 1 II III I MM I I I M IIIIIII II llllll 

683 CAGAATGTANGAGGAG CCTN CCACGGAGCAGAANATGCCACATCACCG CANGATC CTTTG 742 
421 CTCTGCACAGTTACCTGTAAACATTGGAATAC 479 

II I Mill I II III Mill II I I I I Ml 

743 CTGCTTGAGCAACCTGCANAACATCGAAACACCTACCAAATAACATNTATAAGTCCAAT 801 



RESULT 7 
BF383724 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



BF383724 594 bp mRNA linear EST 27-NOV-2000 

602044632F1 NCI_CGAP_Li9 Mus musculus cDNA clone IMAGE: 4194295 5 1 , 
mRNA sequence. 
BF383724 

BF383724.1 GI:11365029 
EST. 

Mus musculus {house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 594) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes- of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Jeffrey E. Green, M.D. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Incyte Genomics, Inc. ' 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
http : / / image . 1 Inl . gov 
Plate: LLAM9527 row : p column: 08 
High quality sequence stop: 589. 
Location/Qualifiers 
1. .594 

/organism="Mus musculus" 
/mo l_t ype = " mRNA " 
/strain="FVB/N" 
/db_xref ="taxon: 10090" 
/clones "IMAGE: 4194295" 

/lab__host = "DH10B (Tl phage-resistant ) " 
/ clone_lib="NCI_CGAP_Li9" 

/note="Organ: liver; Vector: pCMV-SPORT6; Site_l: NotI; 
Site_2 : Sail; Cloned unidirectionally . Primer: Oligo dT. 
Average insert size 1.9 kb. Constructed by Life 
Technologies. Note: this is a NCI_CGAP Library." 
175 a 162 c 142 g 115 t 



Query Match 64.0%; 
Best Local Similarity 82.0%; 
Matches 400; Conservative 



Score 334.8; DB 10; 
Pred. No. 2.9e-73; 
0; Mismatches 82; 



Length 594; 
Indels 6; 



Gaps 



1; 



Qy 



16 TGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGACAGGGGCTTTTATTTC 75 



Mill MIMIIIIIIMIIIIIIIIMIIMIIIIIIIM lllllllllll III 

Db 107 TGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCGAGGGGCTTTTACTTC 166 

Qy 76 AACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAGACAGGCATCGTGGAT 135 

MM I MINIMI II I Mill 1 1 II i : MIMI 

Db 167 AACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCTCAGACAGGCATTGTGGAT 226 

Qy 136 GAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTACTGTGCACCCCTCAAG 195 

Mill MINIM III Mill 1 1 MM II II MNNNNNIMN II II III 

Db 227 GAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGACTGGAGATGTACTGTGCCCCACTGAAG 28 6 

Qy 196 CCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGACATGCCCAAGACTCAG 255 

II Nil lllllllllll II I INN I Ml IN INI NIMH III Ml II I II 

Db 287 CCTACAAAAGCAGCCCGCTCTATCCGTGCCCAGCGCCACACTGACATGCCCAAGACTCAG 34 6 

Qy 2 56 AAGTATCAGCCTCCATCTACCAACAAGAAAATGAAGTCTCAGAGGAGAAGGAAAGGAAGT 315 

INI M I I III II I Illllllll MM N II INN NNN II III 

Db 347 AAGTCCCCGTCCCTATCGACAAACAAGAAAACGAAGGTGCAAAGGAGAAGGAAAGGAAGT 4 06 

Qy 316 ACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTACAGGATGTAGGAAGA 375 

INNNNNNNNNINN III II I INN II I II WWW NNN II II 

Db 4 07 ACATTTGAAGAACACAAGTAGAGGAAGTG CAGGAAACAAGAC CTACAGAATGTAGGAGGA 4 66 

Qy 3 76 CCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGCTCTGCACAGTTACC 435 

IN I INI MINI NIIIINN ININII I 

Db 4 67 GCCTCCCACGGAGCAGAAAATGCGACATCACCGCAGGATCCTTTGCTGCTTGAGCAACCT 526 
Qy 436 TGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATCACATTTCAAAGATGG 48 9 

WWW I II III III II lllllll II INN Illllllll 

Db 527 GCAAAACATCGAAACACCTACCAAATAACAATAATA 58 6 

Qy 4 90 CATTTCCC 4 97 

II II 

Db 587 GCATTGCC 594 



RESULT 8 
CB959991 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



CB959991 796 bp mRNA linear EST 29-APR-2003 

AGENCOURT_1388 8 044 NIH_MGC__147 Homo sapiens cDNA clone 
IMAGE:30341081 5', mRNA sequence. 
CB959991 

CB959991. 1 GI :30216107 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 796) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection 
Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 
Tissue Procurement: Dr. Stefan Hansson 

cDNA Library Preparation: Michael J. Brownstein (NHGRI ) 
and advice from Piero Carninci (RIKEN) 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



(MGC) 



with help 



FEATURES 

source 



BASE COUNT 
ORIGIN 



cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consortium/ LLNL at: 
http : / / image . llnl . gov 
Plate: NDAM371 row: p column: 18 
High quality sequence stop: 707. 
Location/Qualif iers 
1. .796 

/organism="Homo sapiens" 
/mol__type="mRNA" 
/db_xref="taxon: 9606" 
/clone=" IMAGE: 3 0341081" 
/tissue_type="Human Placenta" 
/lab_host="DH10B TonA" 
/clone_lib="NIH_MGC_147" 

/note="Organ: placenta; Vector: pBluescriptR; Site_l : 
all-XhoI; Site_2: BamH; Oligo-dT primed using primer 
5 ' - TTTTTTTTTTTTTTTTVN - 3 ' , size-selected for average 
insert size 2.3 kb and normalized to ROT 5. This is a 
primary library enriched for full-length clones and 
constructed using the Cap-trapper method (Carninci, in 
preparation) . Library constructed by M. Brownstein 
(NIMH/NHGRI, National Institutes of Health). Note: This is 
a NIHJVIGC library. " 
224 a 197 c 191 g 184 t 



Query Match 63.9%; Score 334.4; DB 14; Length 796; 

Best Local Similarity 84.6%; Pred. No. 3.9e-73; 

Matches 44 5; Conservative 0; Mismatches 26; Indels 55; Gaps 



4; 



Qy 



Db 



1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

MINIM MINI MINI 1 1 1 II I II II 1 1 1 1 1 1 1 II II I II I II II 1 1 1 1 1 II I M 

180 GGAC CGGAGACG CTCTGCGGGGCTGAGCTGGTGGATG CT CTTCAGTT CGTGTGTGGAGAC 23 9 



Qy 

Db 

Qy 

Db 

Qy 

Db 



61 AGGGGCTTTTATTTC7^CAAGCC(^CAGGATACGGCTC(^GC^GTCGGAGGGCACCTCAG 12 0 

IMIIMMIIIIIIIIIIIIIMIMII M III IMMIIIIIII MINI 

24 0 AGGGGCTTTTATTTCAACAAGCC(^(^GGGTATGGCTC(^GCAGTCGGAGGGCGCCTC^G 299 
121 ACAGG CAT CGTGGATGAGTG CTG CTT CCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

1 1 1 1 1 1 1 1 1 1 1 1 M I M I i 1 1 i 1 1 1 M M 1 1 M 1 1 1 1 1 1 1) 1 1 1 1 1 II I M 1 1 1 1 1 1 1 

3 00 ACAGG CAT CGTGGATGAGTG CTG CTTC CGGAG CTGTGATCTAAGGAGGCTGGAGATGTAT 359 
181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

M IMMIMMIMI II III MM Mill hi I Ml 1 1 1 MM I II 1 1 1 II 

3 60 TGCGCACCCCTt^GCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 419 



Qy 

Db 

Qy 

Db 



241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAACAAGAAAA 300 

MIIIIIMM Ml 

420 ATGCCCAAGACCCAG 434 

3 01 AGAAGGAAAGGAAGTAC^TTTGAAGAACAC^GTAGAGGGAGTGC^GGAAAa^GAACTA 360 

1 1 ]! 1 1 1 1 [ 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 [ 1 1 1 r 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 

435 AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTA 48 7 



Qy 



3 61. CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 42 0 



MIMUI llllll Mi MIIIIMIIIIII MM llllllllllll MINI 

Db 488 CAGGATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTG 547 

Qy 421 CTCTGCAC - AGTTACCTG -TAAACA/TTGGAATACCGGCC^^ 478 

III MINIM! Mill llllll III MM Ml I I I 1 1 1 1 

Db 54 8 CTCTGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACAT 607 

Qy 479 TTCAAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

II llllll III MIMMIMIMIMMMMMMMMM 

Db 608 TTAAAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACATTC 653 



RESULT 9 

AW495481/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



FEATURES 

source 



Craniata; Vertebrata; Euteleostomi ; 
Sciurognathi ; Muridae; Murinae; Mus , 



AW495481 499 bp mRNA linear EST 24-FEB-2000 

UI-M-BH3-auy-g-ll-0-UI.sl NIH_BMAP_M_S4 Mus musculus cDNA clone 
UI-M-BH3-auy-g-ll-0-UI 3 ! , mRNA. sequence. 
AW495481 

AW4 95481. 1 GI : 7 065762 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodent ia; 
1 (bases 1 to 499) 

Bonaldo, M . F . , Lennon,G. and Soares, M.B. 

Normalization and subtraction: two approaches to facilitate gene 
discovery 

Genome Res. 6 (9), 791-806 (1996) 

97044477 

8889548 

Contact: Chin, H 

National Institute of Mental Health 

6001 Executive Blvd. Room 7N-7190, MSC 9643, Bethesda, MD 

20892-9643, USA 

Tel: 301 443 1706 

Fax: 301 443 9890 

Ema i 1 : mEST@ma i 1 . nih . gov 

The sequence contained an oligo-dT track that was present in the 
oligonucleotide that was used to prime the synthesis of first 
strand cDNA and therefore this may represent a bonaf ide poly A 
tail. The sequence tag present in the cDNA between the NotI site 
and the oligo-dT track served to identify it as a clone from the 
normalized pineal glands library cDNA Library Preparation: M.B. 
Soares Lab Clone distribution: Researchers may obtain BMAP cDNA 
clones from RESEARCH GENETICS. It should be noted that Bento Soares 
is generating a small number of additional specialized 
non-redundant arrays of BMAP cDNAs whose availability will be 
considered under appropriate and limited collaborative arrangements 
Seq primer: M13 Forward 
POLYA=Yes . 

Location/Qualif iers 

1. .499. 

/organism="Mus musculus" 
/mol_type= ,f mRNA lf 
/strain=" C57BL/6J" 
/db_xref ="taxon: 10090" 



/clone="UI-M-BH3-auy-g-ll-0-UI" 
/dev_stage="27-32 days" 
/lab_host="DH10B (Life Technologies)" 
/clone_lib="NIH_BMAP_M_S4" 

/note= "Vector: pT7T3D-Pac (Pharmacia) with a modified 
polylinker; Site_l: Not I; Site_2 : Eco RI ; The 
NIH_BMAP_M_S4 library is a subtracted library of a series, 
ultimately derived from a mixture of individually tagged 
normalized libraries from ten regions of the mouse brain 
(cerebellum, brain stems, olfactory bulbs, hypothalamus, 
cortex, amygdala, basal ganglia, pineal gland, striatum, 
hipoccampus) after a series of subtractions to reduce the 
representation of cDNAs from which ESTs had already been 
generated. The following serially subtracted libraries 
were generated in this process: NIH_BMAP_M_S4 , 
NIH_BMAP_M_S3.3, NIH_BMAP_M_S3 . 2 , NIH_BMAP_M_S3 . 1 , 
NIH_BMAP_M_S2, NIH_BMAP_M_S1 . The subtracted library 
(NIH_BMAP_M_JS4) was constructed as follows: PCRamplified 
cDNA inserts f rom NIH_BMAP_M_S3 . 3 , NIH_BMAP_M_S3 . 2 , and 
NIH_BMAP_M_S3 . 1 clones from which 3' ESTs had been derived 
was used as a driver in a hybridization with a pool of 
the NIH_BMAP_M_S3 .3, NIH_BMAP_M_S3 . 2 , and NIH_BMAP_M_S3 . 1 
libraries in the form of single-stranded circles. The 
remaining single-stranded circles (subtracted library) 
was purified by hydroxyapatite column chromatography, 
converted to double- stranded circles and electroporated 
into DH10B bacteria (Lif eTechnologies) to generate the 
NIH_BMAP_M_S4 library. This procedure has been previously 
described ( Bona 1 do, Lennon and Soares, Genome Research 
6:791-806, 1996) 
TAG_L I B = N I H_BMAP_M__S 4 
TAG_TISSUE=pineal -glands 
TAG_SEQ=CAGAC" 

BASE COUNT 86 a 112 c 124 g 177 t 

ORIGIN 

Query Match 61.6%; Score 322.2; DB 9; Length 499; 

Best Local Similarity 82.3%; Pred. No. 4e-70; 



Matches 


396; Conservative 0; Mismatches 78; Indels 7; Gaps 


2; 


Qy 


50 


TGTGTGGAGACAGGGGCTTTTATTT(^CAAGCC(^(^GGATACGGCTCaVGCAGTCGGA 


109 


Db 


499 


MINIM Illllllllll MIMMIMMIMM II IIIIIIIMI Mill 

TGTGTGGACCGAGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGA 


440 


Qy 


110 


GGGCACCTCAGACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGC 


169 


Db 


439 


MMIMMIMMIMM MIMIMIM MMMIIIIMIIMIMMIMM 1 

GGGCACCTCAGACAGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGAC 


380 


Qy 


170 


TGGAGATGTACTGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGC 

Mill MIMIMIM II II Mill II II 1 1 1 II II 1 M 1 II 1 II Ml II II 

TGGAGATGTACTGTGCCCCACTGAAGCCTACAAAAGCAGCCCGCTCTATCCGTGCCCAGC 


229 


Db 


37 9 


320 


Qy 


230 


GCC&CACCX^CATGCC(^<^CT^ 


289 


Db 


319 


II Mill III II III III MINI Mill 1 1 1 1 III II MM MUM II 

GCCACACTGACATGCCCAAGACTCAGAAGTCCCCGTCCCTATCGACAAACAAGAAAAC 


260 



Qy 



2 90 AGTCTCAGAGGAGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGA 349 



II II IIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMI MINIMI 

Db 25 9 AGCTGCAAAGGAGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGAAGTGCAGGA 2 00 

Qy 3 50 AACAAGAACTACAGGATGTAGGAAGACC CTT CTGAGGAGTGAAGAAGGACAGGC CAC CG C 4 09 

Mill MM Mill II III I MM I M I II MM 

Db 199 AAC^GACCTAC^GAATGTAGGAGGAGCCTCCCACGGAGCAGAAAATGCCACATCACCGC 14 0 
Qy 410 AGGACCCTTTGCTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAA 4 63 

MM MINI II I MUM I II III III II II 

Db 13 9 AGGATCCTTTGCTGCTTGAGCAACCTGCAAAACATCGAAACACCTACCAAATAACAATAA 8 0 

Qy 464 TAAGTTTGATCACATTTCAAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACATT 522 

Mill II Mill IMIMI I MINI 1 1 1 1 1 1 1 1 1 i I M I III! MMM 

Db 79 TAAGTCCAATAACATTACAAAGATGGGCATTTCCCCCAATGAAATATACAAGTAAACATT 2 0 

Qy 523 C 523 

I 

Db 19 C 19 



RESULT 10 

AI876493/C 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



AI876493 642 bp mRNA linear EST 21-JUL-1999 

uj59bl0.xl Sugano mouse liver mlia Mus musculus cDNA clone 
IMAGE: 1924219 3' similar to gb : X57 025_rnal INSULIN-LIKE GROWTH 
FACTOR IA PRECURSOR (HUMAN); gb:X04482 Mouse mRNA for 
preproinsulin-like growth factor IB (MOUSE) ; , mRNA sequence. 
AI876493 

AI876493.1 GI:5550542 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 642) 

Marra,M., Hillier,L., Kucaba,T., Martin, J., Beck,C, Wylie,T., 

Underwood, K. , Steptoe,M., Theising,B., Allen, M. , Bowers, Y. , Person 

,B., Swaller,T., Gibbons, M. , Pape,D., Harvey, N., Schurk,R., Ritter 

,E., Kohn,S., Shin,T., Jackson, Y., Cardenas,M., McCann,R., 

Waterston,R. and Wilson, R. 

The WashU-NCI Mouse EST Project 1999 

Unpublished 

Contact: Marra M/WashU-NCI Mouse EST Project 1999 
Washington University School of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108, USA 

Tel: 314 286 1800 

Fax: 314 286 1810 

Email: mouseest@watson.wustl.edu 

This clone is available royalty- free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
MGI : 980511 

Seq primer: custom primer used 
High quality sequence stop: 257. 

Location/Qualif iers 

1. .642 

/organism="Mus musculus" 
/mol_type= n mRNA " 



BASE COUNT 
ORIGIN 



/strain="C57BL" 
/db_xref= n taxon: 10090" 
/clone= fI IMAGE: 1924219" 
/sex= " female" 
/dev_s t age = "adult " 
/lab_host="DH10B" 

/clone_lib="Sugano mouse liver mlia" 

/note="Organ: liver; Vector: pME18S-FL3 ; Site_l: Drain 
(CACTGTGTG); Site_2 : Drain (CACCATGTG) ; 1st strand cDNA 
was primed with an oligo(dT) primer 

[ATGTGGCCTTTTTTTTTTTTTTTTT] ; double- stranded cDNA was 
ligated to a Drain adaptor [TGTTGGCCTACTGG] , digested 
and cloned into distinct Drain sites of the pME18S-FL3 
vector (5' site CACTGTGTG, 3' site CACCATGTG). Xhol should 
be used to isolate the cDNA insert. Size selection was 
performed to exclude fragments <1.5kb. Library 
constructed by Dr. Sumio Sugano (University of Tokyo 
Institute of Medical Science) . Custom primers for 
sequencing: 5' end primer CTTCTGCTCTAAAAGCTGCG and 3' end 
primer CGACCTGCAGCTCGAGCACA . " 
127 a 154 c 175 g 185 t 1 others 



Query Match 61.3%; Score 320.8; DB 9; Length 642; 

Best Local Similarity 80.1%; Pred. No. 9.6e-70; 

Matches 403; Conservative 0; Mismatches 93; Indels 7; 



Gaps 



Qy 



Db 



2 GACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGACA 61 

I I I I Mill II Mill llllllllllllllllllllllll I I 
503 GACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGATGCTCTTCAGGTCGTGTGTGGACCGA 4 44 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 



62 GGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAGA 121 

MINIMI 1 1 Mill II III MM M I III I Mill I MINIMI I I 

443 GGGGCTTTTTCTTCAACAAGGCCACAGGCTATGGCTCCAGCATTTGGAGGGCACCTCAGA 384 
122 CAGGCATCGTGGATGAGTGCTGCTTCCGG-AGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

Mill II II I II Ml I MM II III I : : i 1 1 1 1 1 1 1 1 1 1 M III III I MM 

383 CAGTCAATGTGGATGAGTGTTGCTTC CGGAAG CTGTGATCTGAGAAGACTGNAGATGTAC 324 
181 TGTGt^CCCCTCAAGCCGGC^^GGCAGCCCGCTC^^ 24 0 

Mill II II Mill II 1 1 I II II II 1 1 1 1 I II 1 1 11 1 1 1 1 1 1 1 1 M 1 1 Ml 

323 TGTGCCCCACTGAAGCCTACAAAAGCAGCCCGCTCTATCCGTGCCCAGCGCCACACTGAC 2 64 
241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAA 300 

IMIIIIMIIIIIIIIII I I I I Ml II IIIIIMI MM II Ml 

263 ATGCCCAAGACTCAGAAGTCCCCGTCCCTATCGACAAACAAG 2 04 

301 AGAAGGAAAGGAAGTACATTTGAAGAA(^(^GTAGAGGGAGTGCAGGAAACAAGAACTA 360 

MM II I I M MUM II I II M I II II I M || M I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml 

2 03 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 144 
361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 42 0 

Ml IIIIIMI II Ml I MM I II I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

143 CAGAATGTAGGAGGAG CCTCCCACGGAG CAGAAAATGCCACATCACCG CAGGATC CTTTG 84 



Qy 



421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATC 474 



Db 



83 CTGCTTGAGCAACCTGCAAAACATCGAAACACCTACCAAATA 24 



Qy 4 75 ACATTTCAAAGATGGCATTTCCC 4 97 

Mill lllllllll II II 

Db 23 ACATTACAAAGATGGGCATTTCC 1 



RESULT 11 
BM984670/C 

LOCUS BM984670 673 bp mRNA linear EST 20-FEB-2003 

DEFINITION UI-CF-ECl-abj -k-24-0-UI . si UI-CF-E'Cl Homo sapiens cDNA clone 

UI-CF-ECl-abj-k-24-0-UI 3', mRNA sequence. 
ACCESSION BM984670 

VERSION BM984670.1 GI: 19610417 

KEYWORDS EST. 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 {bases 1 to 673) 

AUTHORS Bonaldo,M.F. , Lennon,G. and Soares,M.B. 

TITLE Normalization and subtraction: two approaches to facilitate gene 
discovery 

JOURNAL Genome Res. 6 (9), 791-806 (1996) 
MEDLINE 97044477 
PUBMED 8889548 
COMMENT Contact: McCray, PB 

McCray Lab 
University of Iowa 

2024 University of Iowa Med Labs, Iowa City, IA 52242, USA 

Tel: 319 356 4866 

Fax: 319 356 7171 

Email: paul-mccray@uiowa.edu 

Tissue Procurement: Dr. M. J. Welsh, University of Iowa 
cDNA Library preparation: Dr. M. Bento Soares, University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Soares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 
Clone Distribution: Researchers may obtain clones from Research 

Genetics (www.resgen.com) or from Open Biosys terns 

(www.openbiosystems.com) . 

Seq primer: M13 FORWARD 

P0LYA=Yes . 

FEATURES Location/Qualifiers 
source 1. .673 

/organism="Homo sapiens" 
/ mol_type= " mRNA " 
/db_xref = 11 1 axon: 9606" 
/clone="UI-CF-ECl-abj -k-24-0-UI" 
/tissue_type="Lung" 
/dev_stage= "Adult and Fetal" 

/lab__host="DH10B (Life Technologies) (Tl phage resistant) " 
/clone_lib="UI-CF-ECl" 

/note="0rgan: Lung,- Vector: pT7T3-Pac (Pharmacia) with a 
modified polylinker; Site__l : EcoR I; Site_2 : Not I; 
UI-CF-EC1 is a normalized cDNA library containing the 
following tissue (s): Normal lung from adult and from fetal 
day 64, day 87, week 19 and week 42. The library was 



constructed according to Bonaldo, Lennon and Soares, 
Genome Research, 6:791-806, 1996. First strand cDNA 
synthesis was primed with an oligo-dT primer containing a 
Not I site. Double stranded cDNA was ligated to an EcoR I 
adaptor, digested with Not I, and cloned directionally 
into pT7T3-Pac vector. The oligonucleotide used to prime 
the synthesis of first-strand cDNA contains a library tag 
sequence that is located between the Not I site and the 
(dT) 18 tail. The sequence tag for this library is 
AAGTG CTTAC . 
TAG_LIB=UI-CF-EC1 

TAG^TI SSUE= Normal Lung Epithelial Cells Tissue nos 369-371 

and 380-383 

TAG_S EQ = AAGTGCTTAC " 
BASE COUNT 152 a 164 c 169 g 188 t 

ORIGIN 

Query Match 61.3%; Score 320.8; DB 12; Length 673; 

Best Local Similarity 84.2%; Pred. No. 9.7e-70; 

Matches 443; Conservative 0; Mismatches 27; Indels 56; Gaps 5; 
Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 M 1 1 1 1 1 II 1 1 1 MM 1 1 !i I II 

Db 492 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 433 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATAC^ 12 0 

1 1 II I MINIMI! MM Mill II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II MM! 

Db 432 AGGGG - TTTTATTTCAGCAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 3 74 

Qy 121 ACAGG CATCGTGGATGAGTGCTG CTTC CGGAG CTGTGATCTGAGGAGG CTGGAGATGTAC 18 0 

I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 

Db 373 ACAGG CATCGTGGATGAGTGCTG CTTC CGGAG CTGTGATCTAAGGAGG CTGGAGATGTAT 314 

Qy 181 TGTGC^CCCCTCAAGCCGGC^AGGCAGCCCGCT^ 24 0 

II INN Illllllll II Ml 1 1 1 1 Mill II II I III II I II 1 1 Ml I II II I 

Db 313 TGCGC^CCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 254 

Qy 241 ATGCC CAAGACTCAGAAGTATCAG CCT CCATCTAC CTCAGAGG 3 00 

lllllllllll III 

Db 253 ATGCCCAAGACCCAG 23 9 

Qy 301 AGAAGGAAAGGAAGTACATTTGAAGAAO^CAAGTAGAGGGAGTGCAGGAAACAAGAACTA 3 60 

1 1 1 1 1 1 M 1 1 1 1 1 M MUM MIMIIIIIII MIMIII 

Db 23 8 AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTA 186 

Qy 361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 42 0 

1 1 1 1 II 1 1 1 III 1 1 1 I I i 1 1 1 1 1 J 1 1 1 1 1 1 1 MINIMUM MINI 

Db 18 5 CAGGATGTAGGAAGACC CT CCTGAGGAGTGAAGAGTGACATGCCACCG CAGGATCCTTTG 126 

Qy 421 CTCTGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACAT 4 78 

I I I I I II I Illllllll I II I I I I II I I III M I II I II I I I I I I I I I I I I I I 

Db 12 5 CTCTGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACAT 66 

Qy 479 TTCAAAGAT-GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

II MINI III M 1 1 III M 1 1 MM I Ml 1 1 M I M 1 1 1 

Db 65 TTAAAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACATTC 2 0 



RESULT 12 

AI248089/C 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



AI248089 575 bp mRNA linear EST 01-DEC-1998 

qh69f05.xl Soares_f etal_liver_spleen_lNFLS_Sl Homo sapiens cDNA 
clone IMAGE: 1849953 3' similar to gb : X57025_rnal INSULIN -LIKE 
GROWTH FACTOR IA PRECURSOR (HUMAN);, mRNA sequence . 
AI248089 

AI 248 08 9. 1 GI : 3 84348 6 
EST. 

Homo s ap i ens ( human ) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates ; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 575) 

NCI -CGAP http : //www . ncbi ,nlm . nih . gov/ ncicgap . 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP), 

Tumor Gene Index 

Unpublished 

Contact: Robert Strausberg, Ph.D. 
Ema i 1 : cgapbs - r@ma i 1 . nih . gov 

This clone is available royalty-free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
Insert Length: 918 Std Error: 0.00 
Seq primer: -40UP from Gibco 
High quality sequence stop: 380. 

Location/Qualifiers 

1. .575 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref= n taxon: 9606" 
/clone=" IMAGE: 1849953" 
" /sex="male" 
/dev_stage="2 0 week-post conception fetus" 
/lab_host="DH10B (ampicillin resistant)" 
/clone_l ib= " Soares_f etal_l iver_spleen_lNFLS_Sl " 
/note-" Organ: Liver and Spleen; Vector: pT7T3D (Pharmacia) 
with a modified polylinker; Site_l: Pac I; Site_2 : Eco RI ; 
This is a subtracted version of the original Soares fetal 
liver spleen 1NFLS library. 1st strand cDNA was primed 
with a Pac I - oligo(dT) primer [5* 
AACTGGAAGAATTAATTAAAGATCTTTTTTTTTTTTTTTTTTT 3 ' ] , 
double- stranded cDNA was ligated to Eco RI adaptors 
(Pharmacia) , digested with Pac I and cloned into the Pac I 
and Eco RI sites of the modified pT7T3 vector. Library 
went through one round of normalization. Library 
constructed by Bento Soares and M.Fatima Bonaldo." 
135 a 152 c 131 g 156 t 1 others 



Query Match 60.6%; 
Best Local Similarity 83.8%; 
Matches 428; Conservative 



Score 316.8; DB 9; 
Pred. No. 9.4e-69; 
0; Mismatches 28; 



Length 575; 
Indels 55; 



Gaps 



4; 



Qy 



Db 



16 TGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGACAGGGGCTTTTATTTC 75 

Mill llllllllllll IIIIIIIIIIIIIIIIIMII IIIIIIIIIIIIIMIIII 

551 TGCGGGGCTGAGCTGGTGNATGCTCTTCAGTTCGTGTGTGAAGACAGGGGCTTTTATTTC 4 92 



Qy 76 AACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGC^ 135 

t II I It I I I t I I I I II I I II I I I I I I I I I II I lllllllllllllllllllll 
Db 4 91 AAO^GCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAGACAGGaVTCGTGGAT 432 

Qy 136 GAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTACTGTGCACCCCTCAAG 195 

;i 1 1 in i i ii 1 1 1 ii m 1 1 1 1 1 1 ii 1 1 1 1 1 1 ii ii iiiiiiiiiiM 

Db 431 GAGTG CTGCTT C CGGAG CTGTGAT CTAAGGAGGCTGGAGATGTATTG CG CACCCCTCAAG 372 

Qy 196 CCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCC^C^CCGACA.TGCCCAAGACTCAG 255 

II II III 1 1 1 1 Mill MINIUM lllillllllMIIIIMIIMII I III 

Db 371 CCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACAGCGACATGCCCAAGACCCAG 312 

Qy 256 AAGTATCAGCCTCCATCTACCAACAAGAAAATGAAGTCTCAGAGGAGAAGGAAAGGAAGT 315 

II III 

Db 311 AAGGAAGT 304 

Qy 316 ACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTACAGGATGTAGGAAGA 375 

MIIIMMIMI IMIIMIIMMMMIMMIMMIMIMMIMMIMMI 

Db 3 03 ACATTTGAAGAACG CAAGTAGAGGGAGTG CAGGAAACAAGAACTACAGGATGTAGGAAGA 244 

Qy 376 CCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTGGTCTGCAC-AGTTAC 434 

MM MM I Mill III I MM III II I Mill I Mill MINI III MUM 

Db 243 CCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTGCTCTGCACGAGTTAC 184 

Qy 435 CTG-TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTTCAAAGAT-GGCAT 4 92 

III Mill MIMI Ml 1 1 II 1 1 1 1 1 1 1 1 II I M I I II 1 1 1 II I M I III I 

Db 183 CTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACATTTAAAAGATGGGGGT 124 

Qy 4 93 TTCCCCCAATGAAATACACAAGTAAACATTC 523 

1 1 1 ii I Mi 1 1 : II i II 1 1 1 1 1 1 

Db 123 TTCCCCCAATGAAATACACAAGTAAACATTC 93 



RESULT 13 

AI169770/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
COMMENT 



AI169770 468 bp 

EST215669 Normalized rat liver, 
RLIAT07 3' end, mRNA sequence. 
AI169770 

AI169770.1 GI:3709810 
EST. 

Rattus sp. 
Rattus sp. 
Eukaryota ; Metazoa ; 
Mammalia; Eutheria; 
Rattus . 

1 (bases 1 to 468) 
Lee, N . H . , Glodek,A. 



mRNA linear 
Bento Soares Rattus 



EST 20-JAN-1999 
sp. cDNA clone 



Chordata; Craniata; Vertebrata; Euteleostomi ; 
Rodent ia; Sc iurognathi ; Muridae; Murinae; 



Mason, T . M . 



EST 



Chandra , I 
Kerlavage,A.R. and Adams , M . D . 
Rat Genome Project: Generation of a Rat 
Gene Index 
Unpublished 
Other_ESTs: TC50779 
Contact: Lee, NH 

The Institute for Genomic Research 

9712, Medical Center Drive, Rockville, MD 20850, USA 



Quackenb.ush, J . , 
(REST) Catalog & Rat 



Tel: (301) -838-3529 
Fax: (301) -838-0208 
Email: nhlee@tigr.org 
Seq primer: M13-21. 
FEATURES Location/Qualifiers 
source 1. .4 68 

/ organ ism="Rattus sp . " 
/mol_type="mRNA" 

/db_xref="ATCC (inhost) : 2027570" 
/db_xref="taxon: 10118" 
/clone="RLIAT07" 

/clone_lib=" Normalized rat liver, Bento Soares" 
/note-"0rgan: liver; Vector: pT7T3Pac; Site_l: EcoRI ; 
Site_2: Notl" 

BASE COUNT 85 a 115 c 119 g 149 t 

ORIGIN 

Query Match ' 5 9.1%; Score 3 09.2; DB 9; Length 4 68; 

Best Local Similarity 81.8%; Pred. No. 7.2e-67; 

Matches 383; Conservative 0; Mismatches 78; Indels 7; Gaps 2; 
Qy 63 GGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCt^GCAGTCGGAGGGC^CCTCAGAC 122 

1 1 1 1 1 1 1 1 1 IIMIIIIIIIIIIIII I! MINIM s 1 1 1 1 1 1 1 : 1 ! Mill 

Db 4 68 GGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGC 4 09 

Qy 123 AGG CATCGTGGATGAGTG CTGCTT C CGGAGCTGTGAT CTGAGGAGG CTGGAGATGTACTG 182 

Mill MIMMIMI MM IMMIIMM MIMIII III MIMIII 

Db 4 08 GGGCATTGTGGATGAGTGTTGCTCCCGGAGCTGTGATCTGAGGAGGTTGGAGATGTACTG 34 9 

Qy 183 TGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGT^^ 242 

Ml II II Mill Mill MM II III MM MIMMIMI III Mill 

Db 34 8 TGCTCCGCTGAAGCCTAOyyVGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGACAT 2 8 9 

Qy 243 G CC CAAGACT CAGAAGTATCAG C CTCCATCTACCAACAAGAAAATGAAGTCTCAGAGGAG 3 02 

Mill MIMIIIl! Mill I Ml M IIIIIMM MM II Mill 

Db 2 88 GCC CAAGACT CAGAAGTC C CAG CC C CTATCGACACACAAGAAAAGGAAG CTG CAAAGGAG 22 9 

Qy 3 03 AAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGA^ 362 

MMIMMMIMM I III I II II I III II I III I II Ml I II Ml Ml II MM I 

Db 228 AAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTC 169 

Qy 3 63 GGATGTAGGAAGACC CTTCTGAGGAGTGAAGAAGGACAGG C CACCG CAGGACCCTTTG CT 422 

I MIMIII M III I Mill I M I M I Ml MM II IIIIMM 

Db 168 GAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGT CACCG CAAGATCCTTTGCT 109 
Qy 423 CTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATCAC 476 

I Mill I MM III MM IIIIIMM II I 

Db 108 GCTTGAGCAACCTGCAAAAQVTCGGAACACCT 4 9 



Qy 4 77 ATTTCAAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

llllll MM MUM I MM II II III M 1 1 1 1 1 M 1 1 

Db 4 8 ATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTC 1 



RESULT 14 

AA542914/C 

LOCUS 



AA542914 



498 bp mRNA linear EST 19-AUG-1997 



Qy 60 CAGGGGCTTTTATTTC^C^GCCCACAGGATACGGCTCCAGC^GTCGGAGGGCACCTCA 119 

IIIIMI II I III 1 1 1 II 1 1 1 1 1 IN 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill 

Db 416 CAGGGGC-TTTATTTCAACAAGCCCACAGGGTATGGCTCCAG(^GTCGGAGGGCGCCT(^ 358 

Qy 12 0 GACAGGCATCGTGGATGAGTG CTGCTTC CGGAG CTGTGATCTGAGGAGGCTGGAGATGTA 179 

1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 Ml I! 1 1 1 Ml 1 1 1 III I 1 1 1 Ml i I ; 1 1 1 1 1 1 1 1 ! 1 1 1 1 ! 

Db 357 GACAGGCATCGTGGATGAGTG CTG CTTC CGGAG CTGTGATCTAAGGAGG CTGGAGATGTA 2 98 

Qy 180 CTGTGCACCCCTCAAGCCGGOU^GGC^GCCCGCTCCGTCCGTGCCCAGCGCG^CACCGA 23 9 

1 1 II Ml II II I MM II Ml MM Mill II II II I II I II II I M 1 1 1 II I 

Db 2 97 TTGCGCACCCCTC^GCCTGCO^GTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGA 238 

Qy 24 0 CATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCA 2 99 

MIMIMIMI III 

Db 237 CATGCCCAAGACCCAG- 222 

Qy 300 GAGAAGGAAAGGAAGTAC^TTTGAAGAACACAAGTAGAGGGAGTGC^GGAAACAAGAACT 359 

1 1 1 1 1 1 1 1 Ml Ml 1 1 Ml I III I II IM I II II M I II 1 1 1 1 1 1 M 

Db 221 AAGGAAGTACATTTGAAGAACG CAAGTAGAGGGAGTG CAGGAAACAAGAACT 170 

Qy 360 ACAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGG CCAC CG CAGGACCCTTT 419 

llllllll MIMIMIMI I MUM II I MM II II I M I II 1 1 II II II II I 

Db 169 ACAGGATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATG CCAC CG CAGGATCCTTT 110 

Qy 42 0 GCTCTGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACA 477 

III III III MM I MM Mill MM III 1 1 1 II II II II II I II II I II 

Db 109 GCTCTGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACA 5 0 

Qy 478 TTTCAAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

III Mill I III M III II I III II II MM II II II I II I II 

Db 4 9 TTTAAAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACATTC 3 



RESULT 15 

AI604642 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
COMMENT 



AI604642 882 bp mRNA linear EST 21-APR-1999 

vm43d08.yl Stratagene mouse diaphragm (#9373 03) Mus musculus cDNA 
clone IMAGE: 1001007 5' similar to gb:M11568 INSULIN -LIKE GROWTH 
FACTOR IB PRECURSOR (HUMAN); gb:X04482 Mouse mRNA for 
preproinsul in-like growth factor IB (MOUSE);, mRNA sequence. 
AI604642 

AI604642.1 GI:4613809 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota ; Metazoa ; Chordata ; 
Mammalia; Eutheria; Rodent ia; 
1 (bases 1 to 882) 

Marra,M., Hillier,L., Kucaba,T., Mart in, J., Beck,C, Wylie,T., 
Underwood, K. , Steptoe,M., Theising,B w Allen, M., Bowers, Y. , Person 
,B., Swaller,T., Gibbons, M. , Pape,D., Harvey,N., Schurk,R. 
,E., Kohn,S., Shin,T., Jackson, Y. , Cardenas, M. , McCann,R., 
Waterston,R. and Wilson, R. 
The WashU-NCI Mouse EST Project 1999 
Unpublished 

Contact: Marra M/WashU-NCI Mouse EST Project 1999 
Washington University School of Medicine 



Craniata; Vertebrata; Euteleostomi ; 
Sc iurognat hi ; Muridae; Murinae; Mus. 



Ritter 



DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



BASE COUNT 
ORIGIN 



ni98cl0.sl NCI__CGAP_Pr21 Homo sapiens cDNA clone IMAGE : 984882 3' 
similar to gb : X57025_rnal INSULIN-LIKE GROWTH FACTOR IA PRECURSOR 
(HUMAN) ; , mRNA sequence. 
AA542914 

AA542914 . 1 GI :2291394 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 498) 

NCI -CGAP http : //www . ncbi . nlm . nih . gov/ncicgap . 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP) , 

Tumor Gene Index 

Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Michael J. Brownstein, M.D. # Ph.D., Michael R. 
Emmert -Buck , M . D . , Ph . D . 

cDNA Library Preparation: M. Bento Soares, Ph.D. 

cDNA Library Arrayed by: Greg Lennon, Ph.D. 

DNA Sequencing by: Washington University Genome Sequencing Center 
Clone distribution: NCI -CGAP clone distribution information can be 
found through the I.M.A.G.E.- Consortium/LLNL at: 
www-bio . 1 lnl . gov/bbrp / image / image . html 
Insert Length: 603 Std Error: 0.00 
Seq primer: -40ml3 fwd. ET from Amersham 
High quality sequence stop: 412. 
Location/Qualif iers 
1. .498 

/organism= M Homo sapiens" 
/ mo 1 _ t yp e = " mRNA " 
/db_xref ="taxon: 96 06" 
/clone=" IMAGE: 984882" 
/sex="male" 

/ t is sue_type= "normal prostate" 

/lab_host="DH10B" 

/cloneJL ib= "NCI_CGAP_Pr2 1 " 

/note="0rgan: prostate; Vector: pT7T3D-Pac (Pharmacia) 
with a modified polylinker; 1st strand cDNA was prepared 
from normal prostate bulk tissue, and was then primed with 
a Not I - oligo(dT) primer. Double- stranded cDNA was 
ligated to Eco RI adaptors (Pharmacia) , digested with Not 
I and cloned into the Not I and Eco RI sites of the 
modified pT7T3 vector. Library is not normalized. Library 
was constructed by Bento Soares and M. Fatima Bonaldo. " 
105 a 135 c 123 g 135 t 



Query Match 58.3%; Score 305; DB 9; Length 498; 

Best Local Similarity 83.5%; Pred. No. 8.2e-66; 

Matches 440; Conservative 0; Mismatches 30; Indels 57; 



Gaps 



6; 



Qy 



Db 



1 GGACCGGAGACGCTCTGCGGTGC - TGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGA 59 

MINIUM I MMI 1 1 MMMMMMMMMMMMMMMMMM 

4 76 GGACCGGAGAACTTTTGCGGGGCTTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGA 417 



4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108, USA 

Tel: 314 286 1800 

Fax: 314 286 1810 

Email: mouseest @wa t son.wustl.edu 

This clone is available royalty- free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
MGI : 565223 

This read is a RESEQUENCE of a previously sequenced mouse clone 
This read has been verified (found to hit its original self in the 
correct orientation) 
Seq primer: -4 0RP from Gibco 
High quality sequence stop: 361. 
FEATURES Location/Qualifiers 
source 1 . . 882 

/organism="Mus musculus" 

/mol__type="mRNA" 

/db_xref="taxon: 10090" 

/clone- "IMAGE: 1001007" 

/ 1 i s sue__type= "diaphragm" 

/dev_stage=" adult" 

/lab_host="SOLR (kanamycin resistant) " 
/clone_lib="Stratagene mouse diaphragm (#937303) " 
/note= "Organ : diaphragm; Vector: pBluescript SK- ; Site_l: 
EcoRI; Site_2: Xhol; Cloned unidirect ionally from mRNA 
prepared from diaphragm muscle. Primer: Oligo dT. Average 
insert size: 1.5 kb. Uni-ZAP XR Vector; -5' adaptor 
sequence: 5' GAATTCGG CACGAG 3' -3 1 adaptor sequence: 5' 
CTCGAGTTTTTTTTTTTTTTTTTT 3 T " 

BASE . COUNT 236 a 226 c 183 g 223 t 14 others 

ORIGIN 

Query Match 57.2%; Score 2 99; DB 9; Length 882; 

Best Local Similarity 77.2%; Pred. No. 3e-64; 

Matches 393; Conservative 0; Mismatches 109; Indels 7; Gaps 3; 
Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Mill Mill II Mill MINIM 1 1 1 1 1 1 ] 1 : 1 1 1 : 1 11 M M ! 1 1 1 1 

Db 377 GGACCAGAGACCCTTTGCGGNGCTGAGCTGNTGGATGCTCTTCAGTTCGTGTGTGGACCG 43 6 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 

MIMIIMM llllll MIMIM II MIIMIIII I I MM I Ml III I 

Db 437 AGGGGCTTTTACTTCAAOyVGCCC^t^GGCTATGGCTCCAGCATTTCGAGGGCACCTCAG 4 96 

Qy 121 ACAGG CATCGTGGATGAGTG CTG CTT CCGGAG CTGTGAT CTGAGGAGGCTGGAGATGTAC 180 

MIMIM IMIIMMM MIMMMMMIMM I III llllll! 

Db 4 97 ACAGGCATTGTGGATGAGTGTTG CTT CCGGAG CTGTGAT CTGANGAGACTGGAGATGTAC 556 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGt^GCCCGCTCCGTCCGTGCCC^GCGCC^aVCCGAC 24 0 

Mill II II Mill M I MIMIIMM II 1 1 1 II 1 1 1 1 II II 1 1 1 1 III 

Db 557 TGTGCCCCACTGAAGCCTACANAAGCAGCCCGCTCTATCCGTGCCCA.GCGCCACACTGAC 616 

Qy 241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCT 3 00 

MM IMIIMMM II I III II I MIIMIIII I III 

Db 617 ATGCNNCAGACTCAGAAGTCCCCGTNCCTATCGACNNAACAGAAAACGAAGCTTGCAAGG 676 

Qy 3 01 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 3 60 

III MM IMIIMMM Mill II MM llllll I Mill II I Ml 



Db 677 GAGAGGGAAGGGAGTACATTTGAGGAAC^CANGTNGAGGAAGTGCANGAAACAAGACCTA 736 

Qy 361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 420 

I 1 1 1 1 1 1 II II Mill I I I MM II MM 1 1 MIMI 

Db 737 CCAGATGTANGAGGAGCCTTC - - AACCGAGCAGAAATGCACATCACCCGNAGATCCTTTG 794 
Qy 421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAA AAAATAAGTTTGATCAC 476 

II II II III MIMI I I II MM II I I II I I 

Db 795 CT - G CTTGAGCTAACTG CAAACATCGAAC CACTTAC CAATTACAATAATAAGTT CAATAA 853 

Qy 477 ATTTCAAAGATGGCATTTCCCCCAATGAA 505 

I IIMIIIIMIIIIIIIMI I II I 

Db 854 CATACAAAGATGGCATTTCCCCCATTGAA 882 



Search completed: December 13, 2003, 07:29:51 
Job time : 1704.81 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring' table: 



Searched: 



December 13, 2003, 05:41:20 ; Search time 2336.77 Seconds 

(without alignments) 
9156.102 Million cell updates/sec 

US-09-852-261-5 
523 

1 ggaccggagacgctctgcgg aaatacacaagtaaacattc 523 

IDENTITY__NUC 

Gapop 10.0 , Gapext 1.0 

2888711 seqs, 20454813386 residues 



Total number of hits satisfying chosen parameters; 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -process ing : Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



5777422 



Database 



GenEmbl : * 



1 




gb_ba : * 




2 




. gb_htg : 


★ 


3 




gb in : * 




4 




gb om : * 




5 




gb_ov : * 




6 




gb_pat : 




7 




gb_ph : * 




8 




gb_pl : * 




9 




gb_pr : * 




10 


gb__ro : 


* 


11 


gb sts 


: * 


12 


gb_sy : 


* 


13 


gb un: 


* 


14 


gb__vi : 


* 


15 


em_ba : 


* 


16 


em fun 


: * 


17 


em_hum 


. * 


18 


em_in : 


★ 


19 


em mu: 




20 


em_om : 


* 


21 


em or: 


* 


22 


em_ov : 


* 


23 


em_pat 


: * 


24 


em ph: 


* 


25 


em_pl : 


★ 


26 


em ro : 


* 


27 


em__sts 


: * 



28 


em un : * 


29 


em vi : *' 


30 


em htg hum : * 


31 


em_htg_inv : * 


32 


em htg other:* 


33 


em__htg_mus : * 


34 


em_htg_pln: * 


35 


em htg_rod : * 


36 


em htg mam : * 


37 


em htg vrt : * 


38 


em_sy : * 


39 


em htgo hum : * 


40 


em htgo mus : * 


41 


em_htgo_other : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

o, 
o 

Result Query 

No. Score Match Length DB ID Description 



1 


523 


100 


. 0 


523 


6 


AX147746 


AX147746 Sequence 


2 


523 


100 


. 0 


523 


6 


AX300783 


AX300783 Sequence 


3 


467 


.4 


89 


.4 


517 


6 


AX147742 


AX147742 Sequence 


4 


467 


.4 


89 


.4 


517 


6 


AX300779 


AX30077 9 Sequence 


5 


409 


78 


.2 


471 


6 


AX147754 


AX147754 Sequence 


6 


409 


78 


.2 


471 


6 


AX300791 


AX300791 Sequence 


7 


. 364 


.8 


69 


.8 


1536 


10 


BC012409 


BC012409 Mus muscu 


8 


361 


.6 


69 


. 1 


798 


10 


RNIGFI2 


X06108 Rat mRNA (c 


9 


361 


.6 


69 


. 1 


958 


10 


RNIGFI1 


X06107 Rat mRNA (c 


10 


358 


.4 


68 


.5 


710 


10 


RATIGFIA 


M15480 Rat insulin 


11 


356 


.8- 


68 


.2 


539 


6 


AX147744 


AX147744 Sequence 


12 


356 


. 8 


68 


.2 


539 


6 


AX300781 


AX300781 Sequence 


13 


349 


.4 


66 


.8 


651 


10 


MMIGFIBR 


X04482 Mouse mRNA 


14 


336 


64 


.2 


432 


4 


AF022961 


AF022961 Oryctolag 


15 


334 


4 


63 


9 


7260 


6 


AX375028 


AX375028 Sequence 


16 


334 


4 


63 


9 


7260 


6 


AX411095 


AX411095 Sequence 


17 


334 


4 


63 


9 


7260 


9 


HSIGFACI 


X57025 Human IGF-I 


18 


332 


8 


63 


6 


666 


6 


A29119 


A29119 H. sapiens I 


19 


332 


8 


63 


6 


725 


9 


HSIGFI 


X00173 Homo sapien 


20 


332 


8 


63 


6 


728 


9 


HUMGFII 


M2 9644 Human insul 


21 


332 


4 


63 


6 


616 


9 


HSIGF1A 


X56773 H. sapiens m ' 


22 


331 


2 


63 


3 


620 


6 


108370 


108370 Sequence 2 


23 


331 


2 


63 


3 


1076 


9 


HUMIGFI 


M2 7544 Human insul 


24 


316 


8 


60 


6 


444 


9 


HSU40870 


U40870 Human alter 


25 


293 


8 


56 


2 


532 


4 


SSILGF1M 


X174 92 Porcine mRN 


26 


293 


8 


56 


2 


567 


4 


PIGGFIIA * 


M31175 Pig insulin 


27 


286 


2 


54 


7 


888 


4 


ECU85272 


U852 72 Equus cabal 


28 


281 


8 


53 


9 


978 


4 


GOT IGF I 


D11378 Goat mRNA f 


29 


281 


8 


53 


9 


978 


6 


E05279 


E05279 DNA encodin 


30 


280 


2 


53 


6 


1284 


4 


BTILGF1A 


X15726 Bovine mRNA 


31 


275 


4 


5.2 


7 


747 


. 4 


SHPIGFIA6 


M31735 Sheep insul 


32 


275 


4 


52 


7 


790 


4 


SHPIGFIA2 1 


M31734 Sheep insul 


33 


275 


4 


52. 


7 


1015 


4 


SHPIGFIA46 


M31736 Sheep insul 



34 


275 . 


,4 


52, 


,1 


1107 


4 


SHPIGFIA1 


M3 0653 Sheep insul 


35 


275 . 


.2 


52. 


.6 


730 


9 


HSIGF1B 


X56774 H . sapiens m 


36 


275, 


.2 


52 . 


.6 


1094 


9 


HUMGFIB 


M11568 Human insul 


37 


275. 


.2 


52 . 


.6 


1136 


6 


E01349 


E0134 9 cDNA encodi 


38 


275. 


.2 


52 . 


.6 


1136 


6 


108009 


108009 Sequence 3 


39 


274 , 


.6 


52 . 


.5 


3599 


6 


BD063790 


BD063790 Insulin-1 


40 


274 , 


.6 


52. 


.5 


3 5 99 


6 


BD069041 


BD069041 Treatment 


41 


274 . 


.6 


52. 


. 5 


3600 


6 


BD063789 


BD063789 Insulin-1 


42 


274 . 


.6 


52. 


.5 


3600 


6 


BD069040 


BD069040 Treatment 


43 


273 


.6 


52 . 


.3 


612 


9 


HUMIGFIB 


M37484 Human insul 


44 


271 , 


.6 


51. 


.9 


696 


10 


MMIGFIAR 


X04480 Mouse mRNA 


45 


268 , 


.4 


51. 


.3 


513 


26 


RNMGJIL 


M17714 Rat insulin 



ALIGNMENTS 



RESULT 1 
AX147746 

LOCUS AX147746 523 bp DNA linear PAT 31-AUG-2001 

DEFINITION Sequence 5 from Patent WO0136483 . 
ACCESSION AX147746 

VERSION AX147746.1 GI: 14346791 

KEYWORDS 

SOURCE Oryctolagus cuniculus (rabbit) 

ORGANISM Oryctolagus cuniculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 
REFERENCE 1 

AUTHORS Goldspink,G.R. and Johnson, I . R . 

TITLE Use of the insulin-1 ike-growth factor i isoform mgf for the 

treatment of neurological disorders 
JOURNAL Patent: WO 0136483-A 5 25-MAY-2001; 
University College London (GB) 
FEATURES Location/Qualifiers 
source 1. .523 

/organism^" Oryctolagus cuniculus" 
/ mo l_type= "genomic DNA" 
/db__xref ="taxon: 9986" 
CDS <1. .336 

/note="unnamed protein product" 

/codon_start=l 

/prot ein_id- " CAC4 1 1 77 . 1 " 

/db_xref="GI : 14346792" 

/ db_xr e f = " REMTREMBL : CAC4 1177 " 

/ trans lation= "GPETLCGAELVDALQFVCGDRGFYFNKPTGYGSSSRRAPQTGIV 
DECCFRSCDLRRLEMYCAPLKPAKAARSVRAQRHTDMPKTQKYQPPSTNKKMKSQRRR 
KGSTFEEHK" 

BASE COUNT 154 a 129 c 142 g 98 t 

ORIGIN 

Query Match 100.0%; Score 523; DB 6; Length 523; 

Best Local Similarity 100.0%; Pred. No. 9.5e-150; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I II I I 



Db 



1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 



Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 



61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

I M 1 1 1 Ml il 1 1 1 III II I MMI 1 1 1 1 1! Ill 1 1 1 1 il 1 1 M III 1 1 Ml 1 1 1 II II 

61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

121 ACAGG CAT CGTGGATGAGTGCTG CTTCCGGAG CTGTGAT CTGAGGAGGCTGGAGATGTAC 180 

I I I I I I I I I II I I I I I I II I I I ! I I II I I I I I I I I I I I II I I I I I I I I I II I II I I I I I I 

121 ACAGG CATCGTGGATGAGTG CTG CTTCCGGAG CTGTGATCTGAGGAGGCTGGAGATGTAC 180 

181 TGTGCACCCCTC^GCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

I M 1 1 1 Ml M II I III II III Ml II II M MM III M II MIMI I III I M MM 

181 TGTG(^CCCCTC^GCCGGCAAAGGC^GCCCGCTCCGTCCGTGCCCAGCGC(^C^CCGAC 24 0 
241 ATGCCCAAGACTt^GAAGTATCAGCCTCC^TCTACCAACAAGAAAATGA^ 300 

I MM I! II II I II M I M III I M 1 1 II 1 1 MM 1 1 1 1 M 1 1 1 III M M 1 1 1 1 M II 

241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAACAAGAAAATGAAGTC^ 300 
3 01 AGAAGGAAAGGAAGTAC^TTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

I M M I II I II 1 1 1 M I II I II Ml 1 1 1 1 1 1 MM 1 1 1 1 M 1 1 1 Ml 1 1 II 1 1 M II II 

3 01 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 
3 61 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 420 

I IM 1 1 M I M 1 1 1 M I II I II Ml 1 1 1 1 1 1 IMM 1 1 II 1 1 1 1 MM I M 1 1 1 1 M I 

3 61 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 420 
421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTT 480 

MM 1 1 M I M 1 1 1 M I III II Ml 1 1 1 1 1 1 IM 1 1 1 1 M 1 1 1 1 MM I M 1 1 II M II 

421 CTCTG^CAGTTACCTGTAAAC^TTGGAATACCGGCOWy^TAAGTTTGATCACATTT 480 
481 CAAAGATGG CATTT C CCC CAATGAAATACACAAGTAAACATTC 523 

I IM 1 1 Ml M 1 1 1 M I III II Ml 1 1 1 1 1 IMM 1 1 1 1 IM 

481 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 



RESULT 2 
AX300783 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



AX300783 523 bp DNA linear PAT 30-NOV-2001 

Sequence 5 from Patent WO0185781. 

AX300783 

AX300783.1 GI:17382064 

Oryctolagus cuniculus (rabbit) 
Oryctolagus cuniculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; ' Euteleostomi • 

Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 

1 

Goldspink,G.D. and Terenghi , G . B . 

Repair of nerve damage 

Patent: WO 0185781-A 5 15-NOV-2001; 

University College London (GB) ; East Grinstead Medical Research 
Trust (GB) 

Locat ion/Qual i f iers 

1. .523 

/organi sm= " Oryctolagus cuniculus " 

/mol_type= M genomic DNA" 

/db xref="taxon: 9986" I 



CDS <1. .336 

/note= "unnamed protein product" 
/ codon__s t a r t = 1 
/protein_id="CAD13042 .1" 
/db_xref="GI : 17382065" 

/translation- "GPETLCGAELVDALQFVCGDRGFYFNKPTGYGSSSRRAPQTGIV 
DECCFRSCDLRRLEMYCAPLKPAKAARSVRAQRHTDMPKTQKYQPPSTNKKMKSQRRR 
KGSTFEEHK" 

BASE COUNT 154 a 12 9 c 142 g 98 t 

ORIGIN 

Query Match 100.0%; Score 523; DB 6; Length 523; 

Best Local Similarity 100.0%; Pred. No. 9.5e-150; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps . 0; 
Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Db 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAG^ 12 0 

MIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMMIII 

Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCA 120 

Qy 121 ACAGG CATCGTGGATGAGTGCTG CTTCCGGAG CTGTGATCTGAGGAGG CTGGAGATGTAC 18 0 

II MINIM III Mill IIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIMI 

Db 121 ACAGG CATCGTGGATGAGTG CTG CTTCCGGAG CTGTGAT CTGAGGAGG CTGGAGATGTAC 180 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! I II 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 

Db 181 TGTGCACCCCTOVAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

Qy 241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAACAAGAAAATGAAGTCTCAGAGG 300 

II 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 II I II 1 1 II 1 1 M I M I II II 1 1 1 1 1 II II II 1 1 1 II 1 1 

Db 241 ATG CC CAAGACTCAGAAGTAT CAGCCTCCAT CTACCAACAAGAAAATGAAGTCTCAGAGG 3 00 

Qy 3 01 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGC^ 360 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 3 01 AGAAGGAAAGGAAGTACATTTGAAGAACAOVAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

Qy 361 CAGGATGTAGGAAGACC CTT CTGAGGAGTGAAGAAGGACAGG CCACCG CAGGACCCTTTG 420 

II 1 1 1 1 1 M II I II II II II II II II II II 1 1 II II 1 1 M 1 1 1 M 1 1 II II I II I II M I 

Db 3 61 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGG CCACCG CAGGACCCTTTG 420 

Qy 421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAA^ 480 

1 1 M 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTT 480 

Qy 481 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

MMMMMIMMMIMMMIMIMMIMMMIMI 

Db 481 CAAAGATGGCATTTCCCCCAATGAAATACACIAAGTAAACATTC 523 



RESULT 3 
AX147742 

LOCUS AX147742 517 bp DNA linear PAT 31-AUG-2001 

DEFINITION Sequence 1 from Patent WO0136483 . 
ACCESSION AX147742 



VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



AX147742.1 GI: 14346787 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Goldspink,G.R. and Johnson, I. R. 

Use of the insulin- like-growth factor i isoform mgf for the 
treatment of neurological disorders 
Patent: WO 0136483-A 1 25-MAY-2001; 
University College London (GB) 

Location/ Qualifiers 

1. .517 

/organism="Homo sapiens" 
/mol_type= "genomic DNA" 
/db_xref ="taxon: 9606" 
<1. .333 

/not e=" unnamed protein product" 

/codon_start=l 

/protein_id="CAC41175 .1" 

/db_xref="GI : 14346788" 

/ db_xre f = " REMTREMBL : CAC4 1175" 

/ translat ion="GPETLCGAELVDALQFVCGDRGFYFNKPTGYGSSSRRAPQTGI V 
DECCFRSCDLRRLEMYCAPLKPAKSARSVRAQRHTDMPKTQKYQPPSTNKNTKSQRRK 
GSTFEEHK" 
150 a 130 c 139 g 98 t 



Query Match ' 89.4%; Score 467.4; DB 6; Length 517; 

Best Local Similarity 96.2%; Pred. No. 1.3e-132; 

Matches 501; Conservative 0; Mismatches 16; Indels 4; Gaps 



2; 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


298 



I I I 1 1 I Ml III I M I IMMMM 



I I I I I I I II: I I I I II I I I III I I I I HIM 



MMI M I M I IMM I I I II! I I M Mi, I I I 



II 



Mil I II III I II III Mil Mill I III Mill III III II I Mill 



II II I III II I MM II I llllll I 1 1 1 1 1 MM 1 1 1 II I 



III 



-G 297 



I I II I I M II II II II II II II II II I I II I I II II I I II II I I II II II I I I I I I I I I 



Qy 
Db 

Qy 
Db 

Qy 
Db 



361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 420 

MM Mi MIMIII MIMIIIMMII MMMMMMI MMMIMI 

358 CAGGATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACGCTTTG 416 
421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACAT^ 48 0 

MM IMIMMMMM MMMMMMI MMMMMIN MMMIMI 

417 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTT 476 
481 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 521 

M M M M M I M I Ml I 1 1 1 1 MM M I II I II II 

477 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 



RESULT 4 
AX300779 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



AX300779 517 bp DNA linear PAT 30-NOV-2001 

Sequence 1 from Patent WO0185781. 

AX300779 

AX300779.1 GI: 17382060 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Goldspink,G.D. and Terenghi , G . B . 

Repair of nerve damage 

Patent: WO 0185781-A 1 15-NOV-2001; 

University College London (GB) ; East Grinstead Medical Research 
Trust (GB) 

Location/Qualifiers 

1. .517 

/organism= "Homo sapiens" 
/mol_type=" genomic DNA" 
/db_xref="taxon: 9606" 
<1. .333 

/not e=" unnamed protein product" 
/codon_start=l 
/protein_id="CAD1304 0. 1" 
/db_xref = "GI : 17382061" 

/ translat ion="GPETLCGAELVDALQFVCGDRGFYFNKPTGYGSSSRRAPQTGIV 
DECCFRSCDLRRLEMYCAPLKPAKSARSVRAQRHTDMPKTQKYQPPSTNKNTKSQRRK 
GSTFEEHK" 
150 a 130 c 139 g 98 t 



Query Match 89.4%; Score 467.4; DB 6 

Best Local Similarity 96.2%; Pred. No. 1.3e-132 
Matches 501; Conservative 0; Mismatches 16 



Length 517; 

Indels 4; Gaps 2; 



QY 
Db 



1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

II MINIMUM MMIMMM MMMMMMI MMMMMM 

1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 



QY 
Db 



61 AGGGGCTTTTATTTOyiCAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 

M. MMIMMMM MMIMMM 1 1 M 1 1 1 1 1 II 1 1 1 1 M 1 1 1 1 1 MM 

61 AGGGGCTTTTATTTCAAC^GCCC^C^GGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 



Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

IMIIIIMMMIIIIIIIIIIIIIIIII MINI Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 18 0 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

II IM M I I II Ml 1 1 1 1 HIM IIIIMilllMIMIIIIIIIII 

Db 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

Qy 241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCT 3 00 

lllllllllll llllllllllll I I! I Mill II 1 1 1 1 1 I lllllllll I 

Db 241 ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAACAAGAACACGAAGTCTCA G 2 97 

Qy 3 01 AGAAGGAAAGGAAGTACATTTGAAGAA^CAAGTAGAGGGAGTGCAGGAAAOVAGAACTA 3 60 

I M M 1 1 1 1 1 1 Ml I II 1 1 1 1 1 1 1 1 1 III I II II Ml I II 1 1 1 1 1 1 II I II 1 1 ■ 

Db 2 98 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 357 

Qy 361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGG CCAC CG CAGGACCCTTTG 42 0 

lllllllll 

Db 3 58 CAGGATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCG CAGGACCCTTTG 416 

Qy 421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAA 480 

IIIIIIIIMIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIMIIIIIIMIIIIIIII 

Db 417 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCA 4 76 



Qy 481 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 521 

IIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 477 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 



RESULT 5 
AX147754 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

FEATURES 

source 



CDS 



AX147754 471 bp DNA linear PAT 08-JUN-2001 

Sequence 13 from Patent WO0136483. 

AX147754 

AX147754.1 GI:14348552 

Oryctolagus cuniculus (rabbit) 
Oryctolagus cuniculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 

1 

Goldspink,G.R. and Johnson, I. R. 

Use of the insulin-like-growth factor i isoform mgf for the 
treatment of neurological disorders 
Patent: WO 0136483-A 13 25-MAY-2001; 
University College London (GB) 

Location/Qualif iers 

1. .471 

/organism^" Oryctolagus cuniculus" 
/mol_type= 11 genomic DNA" 
/db_xref ="taxon: 9986" 
<1. .318 

/note=" unnamed protein product" 
/codon_start=l 
/protein_id-"CAC41264 . 1" 
/db xref="GI : 14348553" 



BASE COUNT 
ORIGIN 



/ translat ion="GPETLCGAELVDALQFVCGDRGFYFNKPTGYGSSSRRAPQTGIV 
DECCFRSCDLRRLEMYCAPLKPAKAARSVRAQRHTDMPKTQKEVHLKNTSRGSAGNKN 
YRM " 

132 a 118 c 131 g 90 t 



Query Match 78.2%; Score 409; DB 6; Length 471; 

Best Local Similarity 90.1%; Pred. No. 1.3e-114; 

Matches 471; Conservative 0; Mismatches 0; Indels 52; Gaps 



1; 



Qy 

Db 



1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

IIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIMI 

1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 

I II 1 1 1 II II 1 1 1 Ml II M 1 1 i I III 1 1 1 1 1 1 1 II 1 1 II II Ml III I M . 

61 AGGGGCTTTTATTTCAACAAGCCCACAGGATAC^ 12 0 

121 ACAGGCAT CGTGGATGAGTG CTGCTTCCGGAGCTGTGAT CTGAGGAGG CTGGAGATGTAC 18 0 

Mill Mill MINIMI II MINIMI III III I II II I II I MINIM III 

121 ACAGGCATCGTGGATGAGTG CTGCTTCCGGAG CTGTGAT CTGAGGAGG CTGGAGATGTAC 18 0 
181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

MMMIII Illllllll MIMMIM MIMMII MMIIIMM III 

181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

241 ATGCCCAAGACTCAGAAGTATCAG CCTCCATCTAC CAACAAGAAAATGAAGT CT CAGAGG 300 
IMMIMI III 

241 ATGCCCAAGACTCAG 255 

3 01 AGAAGGAAAGGAAGTACATTTGAAGAAC^CAAGTAGAGGGAGTGCAGG 360 

1 1 - 1 1 1 1 1 1 1 ; I 1 1 i 1 1 1 1 1 1 1 I E 1 1 [ I II! I [ i 1 1 1 1 1 1 1 i I 1 1 1 

256 AAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 308 

3 61 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 42 0 

MIMMII MMMIII. MIIIIMM MIMMII MMIIIMM III 

3 09 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGG CCACCG CAGGACC CTTTG 368 
421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA^ 48 0 

IIIIIIIMIIIIIIIIIIMIIIIIMMMIIMMIINIMIIMIIIIIIIIIM 

3 6 9 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCQ^AAAAATAAGTTTGATQVCATTT 428 
481 CAAAGATGGCATTTCCC C CAATGAAATACACAAGTAAACATT C 523 

Mllllli MMIMIMI MIMMIM MIMIIM 

42 9 CAAAGATGG CATTTCCCC CAATGAAATACACAAGTAAACATT C 471 



RESULT 6 
AX300791 
LOCUS 



PAT 3 0-NOV-2 001 



AX300791 471 bp DNA linear 

DEFINITION Sequence 13 from Patent WO0185781. 
ACCESSION AX300791 

VERSION AX300791.1 GI : 17382072 

KEYWORDS 

SOURCE Oryctolagus cuniculus (rabbit) 

ORGANISM Oryctolagus cuniculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 



Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus . 
REFERENCE 1 

AUTHORS Goldspink,G.D. and Terenghi , G . B . 

TITLE Repair of nerve damage 

JOURNAL Patent: WO 0185781 -A 13 15-NOV-2001; 

University College London (GB) ; East Grinstead Medical Research 

Trust (GB) 

FEATURES Location/Qualifiers 
source 1. .471 

/organism= "Oryctolagus cuniculus" 

/mol_type= "genomic DNA" 

/db_xref="taxon: 9986" 
CDS <1. .318 

/not e=" unnamed protein product" 

/codon_start=l 

/protein_id="CAD13 04 5 .1" 

/db_xref="GI : 17382073" 

/ trans lation="GPETLCGAELVDALQFVCGDRGFYFNKPTGYGSSSRRAPQTGIV 
DECCFRSCDLRRLEMYCAPLKPAKAARSVRAQRHTDMPKTQKEVHLKNTSRGSAGNKN 
YRM " 

BASE COUNT 132 a 118 c 131 g 90 t 

ORIGIN 

Query Match 78.2%; Score 409; DB 6; Length 471; 

Best Local Similarity 90.1%; Pred. No. 1.3e-114; 

Matches 471; Conservative 0; Mismatches 0; Indels 52; Gaps 1; 
Qy 1 GGACCGGAGACG CT CTG CGGTG CTGAG CTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

1 1 i 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 

Db 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCC^ 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 61 AGGGGCTTTTATTTCAACAAGCCCAC^GGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 

Db 121 ACAGG CATCGTGGATGAGTG CTG CTTCCGGAG CTGTGATCTGAGGAGGCTGGAGATGTAC 180 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 

Db 181 TGTGC^CCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCC^GCGCCACACCGAC 24 0 

Qy 241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAACAAGAAAAT^ 300 

I lllllll Mill 

Db 241 ATGCCCAAGACTCAG--- 255 

Qy 301 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

IIIIIIMMIIIIIIIIMIII llllllllllllllllllllll MMIM 

Db 256 AAGGAAGTA(LATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 308 

Qy 361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 420 

Db 309 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 368 

Qy 421 CTCTG(JACAGTTACCTGTAAA(^TTGGAATACCGGCCAAAAAATAAGTTTGATC^(^TTT 480 

I I III 1 1 1 1 1 1 1 1 1 1 1 1 1 ! Ill III II I II I II I! I il 1 1 1 III 1 1 1 1 1 1 1 1 1 INI I 



Db 



36 9 CTCTGC^C^GTTACCTGTAAACATTGGAATACCGGCaWVAAATAAGTTTGATCAO^TTT 428 



Qy 481 CAAAGATGGC^TTTCCCCC^TGAAATAG^CAAGTAAACATTC 523 

I I I I I I I I I I I I I I I I I I I I I I I I M II I M I I II I I I I I I I I 

Db 42 9 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACATTC 471 



RESULT 7 
BC012409 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REMARK 
COMMENT 



BC012409 1536 bp mRNA linear ROD 16-APR-2003 

Mus musculus insulin-like growth factor 1, mRNA (cDNA clone 
MGC: 18617 IMAGE:4194295) , complete cds . 
BC012409 

BC012409.1 GI:15214568 
MGC . 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria,- Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 1536) 

Strausberg, R . L . , Feingold, E . A. , Grouse ,L.H., Derge,J.G., 
Klausner,R.D. , Collins , F. S . , Wagner, L., Shenmen , C . M . , Schuler,G.D. , 
, Zeeberg,B., Buetow,K.H., Schaef er, C. F . , Bhat,N.K., ' 
Jordan, H., Moore, T. , Max ,3.1., Wang, J., Hsieh,F., 
, Marusina,K., Farmer, A. A. , Rubin, G.M., Hong,L., 
Soares , M . B . , Bonaldo , M . F . , Casavant , T . L . , 
, Brownstein, M. J. , Usdin,T.B., Toshiyuki , S . , 
Prange,C., Raha,S.S., Loquellano , N . A. , Peters , G.J. ■ , 



Altschul,S.F. 
Hopkins , R . F . , 
Diatchenko, L. 
Stapleton,M. , 
Scheetz,T.E. , 
Carninci , P . 

Abramson,R.D. , Mullahy , S.J. , Bosak,S.A. , McEwan, P.J. , 
McKernan, K. J. , Malek, J. A. , Gunaratne, P.H. , Richards, S. , 
Worley,K.C, Hale,S., Garcia, A.M., Gay,L.J., Hulyk,S.W., 
Villalon,D.K. , Muzny,D.M. , Sodergren, E . J . , Lu,X., Gibbs,R.A., 
Fahey,J., Helton, E., Ketteman,M., Madan,A. , Rodrigues , S . , 
Sanchez, A., Whiting,M., Madan,A. , Young, A. C. , Shevchenko , Y . , 
Bouf fard,G.G. , Blakesley, R . W . , Touchman , J . W . , Green, E . D . , 
Dickson, M.C. , Rodriguez , A. C. , Grimwood,J., Schmutz, J.., Myers ,R.M., 
Butterf ield, Y. S. , Krzywinski , M. I . , Skalska,U. , Smailus,D.E. , 
Schnerch,A. , Schein,J.E., Jones,S.J. and Marra,M.A. 
Generation and initial analysis of more than 15,000 full-length 
human and mouse cDNA sequences 

Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) 

22388257 

12477932 

2 (bases 1 to 1536) 
Strausberg, R. 
Direct Submission 

Submitted ( 15-AUG-2001) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 
USA 

NIH-MGC Project URL : http : //mgc . nci . nih . gov 

Contact : MGC help desk 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Jeffrey E. Green, M.D. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Baylor College of Medicine Human Genome 



Sequencing Center 
Center code: BCM-HGSC 

Web site : http : / /www.hgsc . bcm. tmc . edu/ cdna/ 
Contact : amg@bcm . tmc . edu 

Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H . , 
Kowis, C.R., Sneed, A.J. , Martin, R.G., Muzny, D.M., Nanavati, 
A.N. , Gibbs, R.A. 



FEATURES 

source 



gene 



CDS 



BASE COUNT 
ORIGIN 



Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consort ium/LLNL at: http://image.llnl.gov 
Series: IRAK' Plate: 24 Row: k Column: 11 

This clone was selected for full length sequencing because it 
passed the following selection criteria: matched mRNA gi : 6754307. 

Location/Qualif iers 

1. .1536 

/ organ ism="Mus musculus" 
/mol_type= M mRNA" 
/strain="FVB/N n 
/dbjxref ="taxon: 10090" 
/clone="MGC: 18617 IMAGE : 4194295" 

/tissue_type= "Liver, normal. 5 month old male mouse." 

/clone_l ib= "NCI_CGAP_Li 9 " 

/lab_host="DH10B" 

/note=" Vector: pCMV-SP0RT6" 

1. .1536 

/gene="Igf 1" 

/db_xref="LocusID: 16000" 
/db_xref="MGI : 96432" 
28. .429 
/codon__start = l 

/product^" insulin-like growth factor 1" 
/protein_id="AAH12409 .1" 
/db_xref="GI : 15214569" 
/db_xref=" Locus ID: 16000" 

/ 1 rans 1 a t ion= " MSS SHLFYLALCLLTFTSSTTAGPETLCGAELVDALQF VCGPRG 
FYFNKPTGYGSS I RRAPQTGI VDECCFRSCDLRRLEMYCAPLKPTKAARS I RAQRHTD 
MPKTQKSPSLSTNKKTKLQRRRKGSTFEEHK" 
485 a 324 c 303 g 424 t 



Query Match 69.8%; Score 364.8; DB 10; Length 1536; 

Best Local Similarity 83.2%; Pred. No. 6.6e-101; 

Matches 441; Conservative 0; Mismatches 82; Indels 7; Gaps 2; 
Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Mill Mill II Mill MM II M I Ml M I M I II M 1 1 1 1 1 1 M 1 1 

Db 94 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCG 153 



Qy 61 AGGGGCTTTTATTTCAAC^GCCCACAGGATACGGCTCC^GC^GTCGGAGGGCACCTCAG 12 0 

1 1 [ 1 1 1 1 i 1 1 1 lllllllllllllllll II llllllllll M II 1 1 1 1 1 III II 1 1 

Db 154 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCTCAG 213 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I llllllllllll 

Db 214 ACAGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGACTGGAGATGTAC 273 



Qy 



181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 



Db 
Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 



Mill II II Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 It MIMMMIMIIIMM Ml 

2 74 TGTGCCCCACTGAAGCCTACAAAAGCAGCCCGCTC^^ 333 
241 ATGCCCAAGACTt^GAAGTATC^GCCTCCATCTACCAACAAGAAAATGAAGTCTCAGAGG 300 

1 1 i 1 1 1: I II I I I I III II 1 1 1 1 1 1 1 1 1 1 MM M III 

334 ATGCCCAAGACTCAGAAGTCCCCGTCCCTATCGACAAACAAGAAAACGAAGC 393 

3 01 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAG 360 

MIMMMMMIMIMMIMIMMIIMMIMI IMMMMMIMM III 

3 94 AGAAGGAAAGGAAGTACATTTGAAGAACAC^GTAGAGGAAGTGCAGGAAACAAGACCTA 4 53 

361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 420 

III llhl II III I MM I II I II Ml MM Mill 

4 54 CAGAATGTAGGAGGAG C CT CCCACGGAGCAGAAAATG CCACAT CAC CG CAGGATCCTTTG 513 
421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATC 4 74 

II I MUM I II III III II MM III II 

514 CTGCTTGAGCAACCTGCAAAACATCGAAACA.CCT 573 
475 ACATTTCAAAGAT - GGCATTTCCGCCAATGAAATACACAAGTAAACATTC 523 



Mill MM 



Mil 1 1 1 II 1 1 1 1 Ml Mill I III 



574 ACATTACAAAGATGGGCATTTCCCCCAATGAAATATACAAGTAAACATTC 623 



RESULT 8 

RNIGFI2 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 

FEATURES 

source 



RNIGFI2 798 bp mRNA linear ROD 12-SEP-1993 

Rat mRNA (clone IGF1AB2) for insulin-like growth factor I. 

X06108 M32339 Y00429 

X06108.1 GI: 56426 

insulin- like growth factor I. 

Rattus norvegicus (Norway rat) 

Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 
Rattus . 

1 . 

Shimatsu,A. and Rotwein,P. 

Sequence of Two Rat Insulin-like Growth Factor I mRNAs Differing 
Within the 5' Untranslated Region 
Nucleic Acids Res. 15 (1987) In press 

2 (bases 1 to 798) 
Rotwein,P. 

Direct Submission 

Submitted (21-OCT-1987) Rotwein P., Washington University, School 
of Medicine, 660 South Euclid Avenue, Box 8127, St. Louis, MO 
63110, USA 

Another IGF-I mRNA of rat liver differing in the 5' UT-region is 
described in <X06107>. 

Loca t ion/ Qua 1 i f i er s 

1. .798 

/ organ ism=" Rattus norvegicus" 
/mol__type= "mRNA" 
/strain=" Sprague-Dawley" 
/db_xref="taxon: 10116" 
/ 1 i s sue_type= " 1 iver " 



misc feature 



1. .72 



CDS 



misc feature 



BASE COUNT 
ORIGIN 



238 a 



/note="5" UT-region" 
73. .552 

/note="IGF-I (AA 1-159)" 
/codon_start=l 
/protein_id="CAA294 81 . 1" 
/db_xref="GI : 56427" 

/ 1 rans 1 a t ion= " MGKI SSLPTQLFKI CLCDFLKI KI HI MSSSHLFYLALCLLTFTS 
SATAGPETLCGAELVDALQFVCGPRGFYFNKPTGYGSS I RRAPQTGI VDECCFRSCDL 
RRLEMYCAPLKPTKSARS I RAQRHTDMPKTQKSQPLSTHKKRKLQRRRKGSTLEEHK" 
553. .798 

/note="3 f UT- region 11 

196 c 183 g 181 t 



Query Match 69.1%; Score 361.6; DB 10; Length 798; 

Best Local Similarity 82.8%; Pred. No. 5.8e-100; 

Matches 439; Conservative 0; Mismatches 84; Indel.s 7; 



Gaps 



2; 



Qy . 



Db 



1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Mill Mill II Mill MM III llll 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

217 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 276 



QY 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 



61 AGGGGCTTTTATTTCAACAAGCCC^CAGGATACGGCTC 12 0 

MM, MM I ill 1 1 1 U l i i 1 1 II IIMIIIIII IMMIMMM II! 

277 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATC 336 
121 ACAGG CATCGTGGATGAGTGCTG CTTC CGGAG CTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

II 1 1 1 1 1 IMMMIMI MMMMMMMMMMIMMMMMIMIMM 

337 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 3 96 
181 TGTGC^CCCCTCAAGCCGGC^^GGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

Mill II II Mill Mill MM II III llll I 1 1 1 1 1 ! 1 1 1 ! I III 

3 97 TGTGCTCCGCTGAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCaVCACTGAC 4 56 
241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATC 3 00 

1 1 1 1 1 1 M I ! 1 1 M 1 1 1 II Mill I III II 1 1 1 1 1 1 1 1 1 MM II Ml 

457 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGAC^CA^ 516 
301 AGAAGGAAAGGAAGTA(^TTTGAAGAACA.CAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

Ml I I i I i I llllllllllllllllllll llllllllllllllll III 

517 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 576 
361 CAGGATGTAGGAAGACCCTT CTGAGGAGTGAAGAAGGACAGG CCACCG CAGGACCCTTTG 42 0 

I I I 1 1 1 II I II 1 1 II I I 1 1 1 II I 1 1 I II I 1 1 1 1 1 1 1 M II I M I 

577 C^GAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 636 
421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATC 4 74 

M I MMM MM Ml llll II III llll II 

637 CTGCTTGAGCAACCTGCAAAACATCGGAACACCT^ 696 

4 75 ACATTTCAAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

III MM IMIIIIIII M II 1 1 1 1 1 1 1 MM I II III 

697 CCATTTCAGAGATGGG(^TTTCCCT(^TGAAATA(^CAAGTAAACATTC 746 



RESULT 9 
RNIGFI1 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 

FEATURES 

source 



misc feature 



CDS 



misc feature 



BASE COUNT 
ORIGIN 



RNIGFI1 958 bp mRNA linear ROD 12-SEP-1993 

Rat mRNA (clone IGF1AB1) for insulin-like growth factor I. 

X06107 M32260 Y00429 

X06107.1 GI: 56424 

insulin- like growth factor I. 

Rattus norvegicus (Norway rat) 

Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; 
Rattus . 
1 

Shimatsu,A. and Rotwein, P. 

Sequence of two rat insulin- like growth factor I mRNAs differing 

within the 5 ' untranslated region 

Nucleic Acids Res. 15 (17), 7196 (1987) 

88015572 

3658684 

2 (bases 1 to 958) 
Rotwein, P . 
Direct Submission 

Submitted (21-OCT-1987 ) Rotwein P., Washington University, School 
of Medicine, 660 South Euclid Avenue, Box 8127, St. Louis, MO 
63110, USA 

Another IGF-I mRNA of rat liver differing in the 5' UT-region is 
described in <X06108>. 

Location/Qualif iers 

1. .958 

/organism^" Rattus norvegicus" 
/ mo 1 _ t ype = " mRNA " 
/strain="S rpague - Da wl ey " 
/db_xref="taxon: 10116" 
/ 1 issue_type= " 1 iver " 
1. .258 

/note="5* UT-region" 
259. .738 

/note="IGF-I (AA 1-159)" 
/codon_start=l 
/protein_id="CAA29480. 1" 
/db_xref="GI : 56425" 
/db_xref = " SWI SS - PROT : P08 024 " 

/ translat ion= " MGKISSLPTQLFKI CLCDFLKIKIHIMSSSHLFYLALCLLTFTS 
SATAGPETLCGAELVDALQFVCGPRGFYFNKPTGYGSS I RRAPQTG I VDECCFRS CDL 
RRLEMYCAPLKPTKSARS I RAQRHTDMPKTQKSQPLSTHKKRKLQRRRKGSTLEEHK" 
736. .958 

/note="3 ! UT-region" 
260 a 246 c 209 g 243 t 



Query Match 69.1%; Score 361.6; DB 10; Length 958; 

Best Local Similarity 82.8%; Pred. No. 6e-100; 

Matches 439; Conservative 0; Mismatches 84; Indels 7; Gaps 



2; 



Qy 



Db 



1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Mill Mill II ill, M MINIMI MIMMMM MINIM 

4 03 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 462 



Qy 



61 AGGGGCTTTTATTTCAAC^GCCC^CAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 



minimi m m imim n n mimiii iimiiimi m 

Db 463 AGGGGCTTTTACTTCAAC^GCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCACAG 522 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

II Mill III I II i 1 1 Ml 1 1 1 III II ! 1 1 1 II IM 1 1 i li I II II 

Db 523 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 582 

Qy 181 TGTGCACCCCTCAAGCCGGC^^GGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

Mill M III III I Mill MINI III MM Mill I MIMIII III 

Db 583 TGTGCTCCGCTGAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 642 

Qy 241 ATGCCCAAGACTCAGAAGTAT^GCCTCOVTCTACCAACAAGAAAATGAAGTCTCAGAGG 300 

Mill II Mill MM III Mill I III II II II 1 1 1 II MM II III 

Db 643 ATGCCCAAGACT CAGAAGTC CCAG C CCCTATCGACACACAAGAAAAGGAAGCTG CAAAGG 7 02 

Qy 301 AGAAGGAAAGGAAGTACATTTGAAGAACAOVAGTAGAGGGAGTGC^GGAAAa^GAACTA 360 

! I M 1 1 1 II II I ! Ill 1 1 IIIIIIIMIIIII Mill Ml Mill llllll III 

Db 703 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 762 

Qy 361 CAGGATGTAGGAAGACC CTTCTGAGGAGTGAAGAAGGACAGGC CACCGCAGGAC C CTTTG 420 

III MIMIII II III I Mill I II I II I IMMII II IMIM 

Db 763 C^GAATGTAGGAGGAGCGTCCCGAGGAACAGAAAATGCC^CGT(^CCGC^AGATCCTTTG 822 
Qy 421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATC 474 

II I llllll MM III MM II III MM II 

Db 823 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAATAT^^ 882 

Qy 475 ACATTTCAAAGAT-GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

Mill 1 1 MM MM I Mill MIMIII III II I II I II III III 

Db 883 CCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTC 932 



RESULT 10 

RATIGFIA 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



RATIGFIA 710 bp mRNA linear ROD 27-APR-1993 

Rat insulin-like growth factor I (IGF-I) mRNA, complete cds . 
M15480 

M15480.1 GI :204749 

growth factor; insulin-like growth factor. 
Rattus norvegicus (Norway rat) 
Rattus norvegicus 

Craniata ; Vertebrata ; Euteleostomi ; 
Sciurognathi; Muridae; Murinae; 



Chordata ; 
Rodent i a; 



Lowe , W . L . Jr . , Seaman , W . T . and 



Eukaryota; Metazoa; 
Mammalia; Eutheria; 
Rattus . 

1 (bases 1 to 710) 
Roberts , C.T. Jr., Lasky,S.R. 
LeRoith,D. 

Molecular cloning of rat insulin- like growth factor I complementary 

deoxyribonucleic acids: differential messenger ribonucleic acid 

processing and regulation by growth hormone in extrahepatic tissues 

Mol. Endocrinol. 1 (3), 243-248 (1987) 

88288198 

3453891 

Original source text: Rat (Sprague-Dawley) adult liver cDNA to 
mRNA, clone pRIGF-1-42. 

Draft entry and computer -readable copy of sequence in [Mol. 
Endocrinol. (1987) In press] kindly 



provided by S.R.Lasky, 16-MAR-1987. 
FEATURES Loca t ion/ Qual i f iers 

source 1. .710 

/organism="Rattus norvegicus" 

/mol_type= " mRNA" 

/db_xref ="taxon: 10116" 
CDS - 106. .507 

/note=" insulin-like growth factor precursor" 

/codon_start=l 

/protein_id="AAA41385 . 1" 

/db_xref ="GI : 204750" 

/translation="MSSSHLFYLALCLLTFTSSATAGPETLCGAELVDALQFVCGPRG 

FYFNKPTGYGSS I RRAPQTGI VDECCFRSCDLRRLEMYCVRCKPTKSARS I RAQRHTD 

MPKTQKSQPLSTHKKRKLQRRRKGSTLEEHK" 
sig_peptide 106. .171 

/note=" insulin-like growth factor signal peptide" 
mat_peptide 172. .258 

/product =" insulin-like growth factor B peptide" 
matjpeptide 259. .294 

/product^" insulin-like growth factor C peptide" 
mat_peptide 295. .357 

/product^" insulin-like growth factor A peptide" 
mat_peptide 358. .381 

/product="insulin-like growth factor D peptide" 
matjpeptide 382. .504 

/product^" insulin-like growth factor E peptide" 
BASE COUNT 202 a 191 c 172 g 145 t 

ORIGIN 1 bp upstream of EcoRI site. 

Query Match 68.5%; Score 358.4; DB 10; Length 710; 

Best Local Similarity 82.5%; Pred. No. 5.5e-99; 

Matches 43 7; Conservative 0; Mismatches 86; Indels 7; Gaps 2; 
Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Mill II I! I II Mill II II Ml IN: I MM III MUM II Mill 

Db 172 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 231 

Qy 61 AGGGGCTTTTATTTCAA(^GCC(^(^GGATACGGCTCCAG(^GTCGGAGGGCACCTCAG 120 

Mil IIIIIIIIIMIIIIM M IMIMMII MMIMIMM III 

Db 232 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGC^ 2 91 

Qy 121 ACAGGCAT CGTGGATGAGTG CTG CTTCCGGAG CTGTGATCTGAGGAGGCTGGAGATGTAC 180 

II Mill MINIM MMIIIMI IMIIMIN MMIIIMI II 

Db 2 92 ACGGGCATTGTGGATGAGTGTTG CTT CCGGAG CTGTGATCTGAGGAGGCTGGAGATGTAC 351 

Qy 181 TGTGC^CCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

MM II MUM Mill MM II III MM MIMMMIMM III 

Db 352 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 411 

Qy 241 ATG CCCAAGACTCAGAAGTATCAG C CTCCAT CTACCAACAAGAAAATGAAGTCTCAGAGG 3 00 

MMIIIMI Illllllll Mill I III M II II I II 1 1 MM II III 

Db 412 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACAC^ 4 71 

Qy 3 01 AGAAGGAAAGGAAGTACATTTGAAGAAC^C^GTAGAGGGAGTGC^GGAAAC^GAACTA 360 

MM 1 1 1 1 1 1 1 MM M I 1 1 1 1 i 1 1 1 1 [ 1 1 i I J 1 1 III MMIIMII Ml 

Db 472 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAAC^GACCTA 531 



Qy 
Db 

Qy 
Db 

Qy 
Db 



3 61 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 420 

III I III III I Mill I Mill I II I II I I III II I II llllll 

532 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 5 91 
421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATC 4 74 

I! I llllll MM III MM II Ml MM II 

5 92 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTG 651 
475 ACATTTCAAAGAT-GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

MM MM Mill Mill MMMIMM I II MMMIMM 

652 CCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTC 701 



RESULT 11 

AX147744 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

FEATURES 

source 



AX147744 
Sequence 3 
AX147744 
. AX147744 . 1 



CDS 



53 9 bp 
from Patent WO0136483. 

GI : 14346789 



DNA 



linear 



PAT 31-AUG-2 001 



BASE COUNT 
ORIGIN 



Rattus norvegicus (Norway rat) 
Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 
Rattus. 
1 

Goldspink,G.R. and Johnson, I. R. 

Use of the insulin-like-growth factor i isoform mgf for the 
treatment of neurological disorders 
Patent: WO 0136483-A 3 25-MAY-2001; 
University College London (GB) ^ 

Location/Qualifiers 

1. .539 

/organism= "Rattus norvegicus" 
/mol_type= "genomic DNA" 
/db_xref="taxon: 10116" 
<1. -336 

/not e= "unnamed protein product" 
/codon_start=l 
/protein_id= "CAC4 1176 . 1 " 
/db_xref ="GI : 14346790" 
/db_xr e f = " REMTREMBL : CAC4 1176" 

/ translat ion="GPETLCGAELVDALQFVCGPRGFYFNKPTVYGSSIRRAPQTGIV 
DECCFRSCDLRRLEMYCVRCKPTKSARSIRAQRHTDMPKTQKSQPLSTHKKRKLQRRR 
KGSTLEEHK" 
161 a 136 c 139 g 103 t 



Query Match 68.2%; Score 356.8; DB 6; Length 53 9; 

Best Local Similarity 82.3%; Pred. No. 1.6e-98; 

Matches 436; Conservative 0; Mismatches 87; Indels 7; Gaps 2; 
Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Mill Mill II Mill MMMIMM 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II I 

Db 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 



Qy 



61 AGGGGCTTTTATTTCAACAAGCCCACAGGAT^ 12 0 



Db 
Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



. IMIIIIIIII IIIIIIIIMIIIIII II MINIM MINIM Ml 

61 AGGGGCTTTTACTTCAACAkGCCC^ 12 0 

121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

II Mill INI IN 1 1 1 1 III I ! I NNI 1 1 1 II 1 1 1 1 N N 1 1 1 1 1 1 1 

121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 
181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACAGCGAC 24 0 

MM II MM 1 1 Mill MM II III MM MM I III MM II III 

181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 
241 ATG C CCAAGACT CAGAAGTATCAGC CTC CATCTACCAACAAGAAAATGAAGTCTCAGAGG 3 0 0 

IIIIIIMIIMIIIIMI Mill I Ml II MIMIMI MM II III 

241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACACAAGAAAAGGAAGCTGCAAAGG 300 
3 01 AGAAGGAAAGGAAGTAC^TTTGAAGAACAC^AGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

IIMMMMMIIMM 1 1 1 1 1 1 i 1 1 1 E 1 1 1 1 1 1 1 1 1 1111,111111 Ml 

3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 360 
361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 420 

III IMIIMI II III I Mill I II I II I IIIMM II NNI 

361 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 42 0 
421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATC 474 

II I INN MM Ml MM II III MM II 

421 CTG CTTGAG CAACCTG CAAAACAT CGGAACAC CTGCCAAATATCAATAATGAGTT CAATA 48 0 

4 75 ACATTTCAAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

IIIMM MM IMMMIII II I II N N I i 1 1 1 1 1 N N I 

481 TCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTC 530 



RESULT 12 

AX300781 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



CDS 



AX300781 539 bp DNA linear PAT 30-NOV-2001 

Sequence 3 from Patent WO0185781. 

AX300781 

AX300781.1 GI:17382062 

Rattus sp. 
Rattus sp. 

Eukaryota; Metazoa; Chordata; Crania ta; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; 
Rattus . 
1 

Goldspink,G.D. and Terenghi , G . B . 

Repair of nerve damage 

Patent: WO 0185781-A 3 15-NOV-2001 ; 

University College London (GB) ; East Grinstead Medical Research 
Trust (GB) 

Locat ion/Qual i f iers 

1. .539 

/organism^ "Rattus sp." 
/mo l_type= "genomic DNA" 
/db_xref="taxon: 10118" 
<1. .336 

/not e= "unnamed protein product" 



BASE COUNT 
ORIGIN 



/codon_start=l 
/protein_id="CAD13.041 . 1" 
/db_xref="GI : 17382063" 

/translation="GPETLCGAELVDALQFVCGPRGFYFNKPTVYGSSIRRAPQTGIV 
DECCFRSCDLRRLEMYCVRCKPTKSARS I RAQRHTDMPKTQKSQPLSTHKKRKLQRRR 
KGSTLEEHK" 
161 a 136 c 139 g 103 t 



Query Match 68.2%; Score 356.8; DB 6; Length 539; 

Best Local Similarity 82.3%; Pred. No. 1.6e-98; 

Matches 436; Conservative 0; Mismatches 87; Indels 7; 



Gaps 



2; 



Qy 
Db 



1 GGAC CGGAGACGCT CTG CGGTG CTGAG CTGGTGGATG CT CTTCAGTTCGTGTGTGGAGAC 60 

II II Mill II Mill I ! 1 1 1 1 1 1 1 1 1 I Ml IMIIUIIII Mill 

1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 



Qy 

Db 

Qy 

Db 

Qy 
Db 



61 AGGGGCTTTTATTTC^CAAGCCC^CAGGATACGGCTCC^GCAGTCGGAGGGCACCTCAG 12 0 

MIMMIMI MMMMMMIMI II MIIIIMM 

61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGC^TTCGGAGGGCACCACAG 12 0 
121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

II Mill Illllllllll MMMMMIMMMMMMMIIMMIMIMM 

121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 
181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCC^GCGCC^CACCGAC 24 0 

MM I I llllll Mill MM II III MM MIIIIMM MM III 

181 TGTGTCCGCTGCAAGCCTACAAAGTaVGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 



Qy 

Db 

Qy 
Db 

Qy 
Db 



241 ATG CC CAAGACTCAGAAGTATCAGCCTCCAT CTACCAACAAGAAAATGAAGTCTCAGAGG 3 0 0 

IIIIIIIIIIIIIIMIM Mill I III II 1 1 1 1 1 1 1 i i MM II III 

241 ATG CCCAAGACT CAGAAGTC C(^G CCC CTATCGAC^CACAAGAAAAG^ CAAAGG 3 0 0 

301 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAAC^ 360 

l h i I I, I, IJIMIIIIIIIIIMIIII llllllllllllllll III 

3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 360 
361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 42 0 

III IMIIIII II III I Mill 

361 CAGAATGTAGGAGGAGC CTC C CGAGGAACAGAAAATGCCACGTCACCGCAAGATC CTTTG 42 0 



Qy 

Db 

Qy 

Db 



421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATC 474 

II I llllll MM III MM II III MM II 

421 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAA^ 4 8 0 

475 ACATTTCAAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

I MM MM III II I I III III II II MM II MM MM 

481 TCATTTCAGAGATGGGCATTTCCCTC^TGAAATACACAAGTAAACATTC 53 0 



RESULT 13 

MMIGFIBR 

LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 



MMIGFIBR 651 bp mRNA linear ROD 21-MAR-1995 

Mouse mRNA for preproinsul in-like growth factor IB. 

X04482 

X04482.1 GI:51806 

growth factor; insulin-like growth factor IB; preproinsul in- like 



SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



FEATURES 

source 



misc feature 



CDS 



sigjpeptide 
mat_pept ide 
misc_f eature 
polyA_site 



BASE COUNT 
ORIGIN 



growth factor IB; signal peptide. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 651) 

Bell, G. I., Stempien,M.M. , Fong , N . M . and Rall,L.B. 

Sequences of liver cDNAs encoding two different mouse insulin- like 
growth factor I precursors 

Nucleic Acids Res. 14 (20), 7873-7882 (1986) 

87040760 

3774549 

The sequence is identical to the preproIGF-IA sequence (X04480) 
except for the presence of a 52 bp insertion following codon 8 6 
(position 397 to 448) , caused by alternative RNA splicing. The B 
domain of IGF comprises residues 1-29 (position 139-225) , the C 
domain residues 30-41 (position 226-261) , the A domain residues 
42-62 (position 262-324) and the D domain residues 63-70 (position 
325-348) . 

Location/Qualifiers 
1. .651 

/organism="Mus musculus" 
/mol_type="mRNA" 
/strain="Balb/c" 
/db_xref = M taxon: 10090" 
/clone="migf 1-4 " 
/tissue_type=" liver" 
43. .45 

/note="pot. translation start" 
73. .474 

/note="prepro IGF-IB (aa -22 to 111) 11 
/codon_start=l 
/prot ein_id= " CAA2 8170.1" 
/db_xref="GI : 518 07" 
/db_xref= M MGI : 96432" 
/ db_xr e f = " SW I S S - PROT : P 0 5 0 1 8 " 

/translat ion="MSSSHLFYLALCLLTFTSSTTAGPETLCGAELVDALQFVCGPRG 
FYFNKPTGYGSS I RRAPQTGI VDECCFRSCDLRRLEMYCAPLKPTKAARS I RAQRHTD 
MPKTQKSPSLSTNKKTKLQRRRKGSTFEEHK" 
73. .138 

/note="signal peptide (aa -22 to -1) " 
139. .348 

/product = "mature IGF-IB (aa 1-70)" 
349. .471 

/note="COOH-terminal peptide (E domain) (aa 71 to 111) 11 
651 

/note="polyA site" 

185 c 149 g 124 t 



193 a 



Query Match 66.8%; Score 34 9.4; DB 10; Length 651; 

Best Local Similarity 82.8%; Pred. No. 3. 2e-96; 

Matches 425; Conservative 0; Mismatches 81; Indels 7; Gaps 



2; 



Qy 



Db 



1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Mill Mill II Mill M II II I II IMI I MM IMI M I Ml II I 

139 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCG 198 



Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAG 12 0 

i : 1 1 1 1 1 1 1 ! 1 1 1 i 1 1 1 1 1 II llllllllll IIIIIIIIMIIIIII 

Db 199 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCTCAG 258 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

IMIIIII lllllllllll Ml ! I Ml I I MM llllllllllll 

Db 259 ACAGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGACTGGAGATGTAC 318 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

MM! II II Mill MM ! MINIM 1 1 1 1 i 1 1 1 1 1 1 1 1 1 M 1 1 1 III 

Db 319 TGTGCCCCACTGAAGCCTACAAAAGCAGCCCGCTCTATCCGTGCCCAGCGCCACACTGAC 378 

Qy 241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAACAAGAAAATGAAGTCTCAGAGG 300 

MMMMMMMIMM M I I Ml II llllllllll MM II III 

Db 3 79 ATGCCCAAGACTCAGAAGTCCCCGTCCCTATCGACAAACAAGAAAACGAAGCTGCAAAGG 438 

Qy 3 01 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

II Mill Mill I llllll I MMMMMMMIMM IIIIIMIIIIMIII III 

Db 43 9 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 4 98 

Qy 361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 42 0 

III Illlllll II III I MM llllll MIMMIII MUM 

Db 4 99 CAGAATGTAGGAGGAGCCTCCCACGGAGCAGAAAATGCCACATCACCGCAGGATCCTTTG 558 
Qy 421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATC 474 

II ! llllll I II III III II lllllll II 

Db 559 CTGCTTGAGCAACCTGCAAAACATCGAAACACCTACa\AATAACAATA^ 618 

Qy 475 ACATTTCAAAGAT-GGCATTTCCCCCAATGAAA 506 

Mill lllllll I 'M I i I I I I 

Db 619 ACATTACAAAGATGGGCATTTCCCCCAATGAAA 651 



RESULT 14 
AF022961 

LOCUS AF022961 432 bp mRNA linear MAM 14-OCT-1997 

DEFINITION Oryctolagus cuniculus insulin-like growth factor IB (IGF-IB) mRNA, 

complete cds . 
ACCESSION AF022961 

VERSION AF022961.1 GI: 2522201 

KEYWORDS 

SOURCE Oryctolagus cuniculus (rabbit) 

ORGANISM Oryctolagus cuniculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 
REFERENCE 1 (bases 1 to 432) 

AUTHORS Flekna,G., Brem,G. and Mueller, M. 
TITLE Direct Submission 

JOURNAL Submitted (05-SEP-1997) Institute of Animal Breeding and Genetics, 
Veterinary University of Vienna, Veterinaerplatz 1, Vienna A-1210, 
Austria 

FEATURES Location/Qualifiers 
source 1. .432 

/organism^ "Oryctolagus cuniculus" 
/ mo 1 _ t yp e = " mRNA " 
/strain="ZIKA hybrid strain" 



gene 
CDS 



BASE COUNT 
ORIGIN 



/db__xref="taxon: 9986" 

/eel l_type= "primary liver cells" 

/tissue__type= H liver" 

1. .432 

/gene="IGF-IB" 
1. .432 

/gene="IGF-IB" 
/ codon_s t a r t = 1 

/product=" insulin-like growth factor IB" 
/protein_id="AAB80950 . 1" 
/db_xref= n GI : 2522202" 

/ 1 r ans lation-"MI TPTVKMR I LS S SHLF YLALCLLTFTSSATAGPETLCGAELVD 
ALQFVCGDRGFYFNKPTGYGSSSRRAPQTGIVDECCFRSCDLRRLEMYCAPLKPAKAA 
RSVRAQRHTDMPKTQKYQPPSTNKKMKSQRRRKGSTFEEHK" 
105 a 126 c 115 g 86 t 



Query Match 64 .2%; 

Best Local Similarity 100.03 
Matches 33 6; Conservative 



Score 336; DB 4; Length 432; 
Pred. No. 4.1e-92; 
0; Mismatches 0; Indels 0; 



Gaps 



0; 



Qy 

Db 



1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

MMM IMMIM MIMMII IMIMM IMIMM IMMIIM IMM 

97 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 156 



Qy 



Db 



61 AGGGGCTTTTATTTCTUVt^GCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 

Mill 11111111' MMIiirillllliM illllir MINI III Mill 

157 AGGGGCTTTTATTTO^CAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 216 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

MIIIIIIIIIIIIIMIIIIMIIIIIIIIMIIIIIIIIIIIIMMIIIIIIIIIII 

217 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 276 
181 TGTGCA.CCCCTC^GCCGGCAAAGGCA.GCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

IMIIIIIMMIIIIIIIIMIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIII 

2 77 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 336 
241 ATGCCCAAGACTCAGAAGTATCAGCCTCC^TCTACCAACAAGAA 3 00 

III. hllMII.IIIMIIII IIIIMII IIIIIIIIMIMIIM llllll 

337 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAAC^ 396 

3 01 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAG 336 

MM MIMMII MMMIM IMIMM I 

3 97 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAG 432 



RESULT 15 

AX375028 

LOCUS 

DEFINITION 
ACCESSION 
VERSION t 
KEYWORDS 
SOURCE 

ORGANISM 



AX375028 7260 bp ; 

Sequence 31 from Patent WO0210436. 
AX375028 

AX375028 .1 GI : 19169860 



(human) 



DNA 



linear 



PAT 01-MAR-2002 



Homo sapiens 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



BASE COUNT 
ORIGIN 



Baak,J. and Mutter , G . L . 

Prognostic classification of breast cancer 

Patent: WO 0210436-A 31 07-FEB-2002; 

THE BRIGHAM AND WOMEN'S HOSPITAL, INC. (US) 

Location/Qualif iers 

1. .7260 

/organism="Homo sapiens" 
/mol_type= "genomic DNA" 
/db_xref="taxon: 9606" 
2330 a 1415 c 1240 g 2275 t 



Baak # Jan (US) 



Query Match 63.9%; Score 334.4; DB 6; Length 7260; 

Best Local Similarity 84.6%; Pred. No. 2e-91; 

Matches 445; Conservative 0; Mismatches 26; Indels 55; Gaps 



4; 



QY 



Db 



1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I 

311 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 370 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



61 AGGGGCTTTTATTTO^CAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

IIMIIIMIIIIIMIIIIIIIMIMI II MMMMMMMIIMM II I II I 

371 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 430 
121 ACAGG CATCGTGGATGAGTG CTG CTTCCGGAG CTGTGATCTGAGGAGG CTGGAGATGTAC 18 0 

I IN 1 1 Ml Ml 1 1 1 ll 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 ! 1 1 1 1 1 1 1 1 i 1 1 1 1 

431 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 4 90 
181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCaVCACCGAC 24 0 

II llllllllllllll 1 1 III 1 1 1 1 Mill M . 1 1 1 1 1 MM 1 1 1 II 1 1 1 1 

4 91 TGCGCACCCCTOyVGCCTGC(^GT(^GCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 55 0 
241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAAC 300 

IIMIMIII III 

551 ATGCCCAAGACCCAG 565 

3 01 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I 
566 AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTA 618 

361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 420 

lllllllllllllllllll I Mill I II II Ml MM 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 

619 CAGGATGTAGGAAGAC C CT C CTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTG 678 
421 CTCTGCAC-AGTTACCTG - TAAACA.TTGGAATACCGGCCAAAAAATAAGTTTGATCACAT 4 78 

hi. I Mi Mill II III hi I I M I I 1 1 1 1 

679 CTCTGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACAT 738 
479 TTCAAAGAT-GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

II MINI III III IIIIIIMIIIIIIIIMI I IIIIMM 

73 9 TTAAAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACATTC 784 



Search completed: December 13, 2003, 09:27:34 
Job time : 2336.77 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2 003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



December 13, 2003, 02:35:18 ; Search time 209.995 Seconds 

(without alignments) 
6723.048 Million cell updates/sec 



Title: US-09-852-261-5 
Perfect score: 523 

Sequence: 1 ggaccggagacgctctgcgg aaatacacaagtaaacattc 523 

Scoring table: I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 



2552756 seqs, 1349719017 residues 



5105512 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -process ing : Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : NJ3eneseq_19 Jun03 : * 

1 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1980 .DAT: * 

2 : - /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1981.DAT:* 

3 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1982 .DAT:* 

• 4 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1983 .DAT: * 

5 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1984 .DAT: * 

6 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1985 .DAT: * 

7 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA198 6 .DAT: * 

8 : /SIDSl/gcgdata/geneseq/ geneseqn-embl/NA1987 . DAT : * 

9 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1988 .DAT: * 

10 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1989 .DAT: * 

11 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1990.DAT: * 

12 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1991 .DAT: * 

13 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1992 .DAT: * 

14 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1993 .DAT: * 

15 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1994 .DAT: * 

16 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1995 .DAT: * 

17 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1996 .DAT: * 

18 : /SIDSl/gcgdata/geneseq/geneseqn-embl'/NA1997 .DAT: * 

19 : / S I DS 1 / gcgda t a / genes eq/ genes eqn- embl / NA1 998. DAT : * 

20 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1999 .DAT: * 

21 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA2 000.DAT: * 

22 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA2001A.DAT: * 

23 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA2001B.DAT: * 

24 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA2 002 . DAT: * 

25 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA2003 .DAT: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 



and is derived by analysis of the total score distribution. 

SUMMARIES 

o 

"O 

Result Query 



ro. 


Score 


Match Length 


DB 


ID 


Description 


1 


523 


100 . 


0 


523 


22 


AAD064 0 0 


Rabbit IGF- I isofo 


2 


523 


100 . 


.0 


523 


24 


AAS16879 


Rabbit mechano-gro 


3 


523 


100 . 


0 


553 


18 


AAT84893 


Rabbit insulin lik 


4 


467 . 


4 


89 . 


4 


517 


22 


AAD06398 


Human IGF- I isofor 


5 


467. 


4 


89 . 


4 


517 


24 


AAS16877 


Human mechano-grow 


6 


409 


78 . 


2 


471 


22 


AAD06405 


Rabbit liver- type 


7 


409 


78 . 


2 


471 


24 


AAS16884 


Rabbit insulin-lik 


8 


356. 


8 


68 . 


2 


539 


22 


AAD06399 


Rat IGF-I isoform 


9 


356. 


8 


68 . 


2 


539 


24 


AAS16878 


Rat mechano- growth 


10 


349. 


,4 


66 . 


8 


651 


25 


ABV76185 


Mouse insulin-like 


11 


334 . 


,4 


63 . 


9 


818 


8 


AAN70436 


Sequence encoding 


12 


334. 


.4 


63 . 


9 


7260 


24 


ABT11091 


Human breast cance 


13 


334 . 


.4 


63 . 


, 9 


7260 


24 


ABK84 583 


Human cDNA differe 


14 


334 . 


.4 


63 . 


9 


7260 


24 


ABN97244 


Gene #3742 used to 


15 


334 . 


.4 


63 . 


, 9 


7260 


24 


ABK64 812 


Human benign prost 


16 


334 . 


.4 


63 . 


, 9 


7260 


24 


ABK35504 


Human endometrial 


17 


334 . 


.4 


63 . 


. 9 


7260 


24 


ABK35561 


Gene IGF1 differen 


18 


332 . 


.8 


63 . 


. 6 


777 


18 


AAT84 8 94 


Human insulin like 


19 


331. 


.2 


63 . 


. 3 


622 


7 


AAN60490 


Human prepro-somat 


20 


281 . 


.8 


53 . 


. 9 


978 


14 


AAQ47804 


Sequence encoding 


21 


275 . 


.2 


52 . 


. 6 


1136 


8 


AAN70435 


Sequence encoding 


22 


274 . 


.6 


52 . 


. 5 


3599 


19 


AAV50428 


Plasmid pIG0552 lo 


23 


274 , 


.6 


52 , 


. 5 


3599 


19 


AAV4 07 96 


Actual sequence of 


24 


274 , 


.6 


52 , 


. 5 


3600 


19 


AAV50427 


Plasmid pIG0552 up 


25 


274 


.6 


52 , 


. 5 


3600 


19 


AAV40795 


Expected sequence 


26 


274 


.6 


52 , 


. 5 


5707 


20 


AAX88055 


Plasmid pIG0335 DN 


27 


274 


.6 


52 ; 


. 5 


6345 


20 


AAX88054 


Plasmid pIGOlOOA D 


28 


273 


.6 


52 . 


. 3 


612 


22 


AAS14695 


Human cDNA encodin 


29 


273 


.6 


52 


. 3 


612 


25 


ABZ83309 


Toxicologically re 


30 


266 


.8 


51 


. 0 


1052 


20 


AAX27498 


Rat liver form of 


31 


262 


50 


. l 


487 


22 


AAD06404 


Rat liver- type IGF 


32 


262 


50 


. 1 


487 


24 


AAS16883 


Rat insulin- like g 


33 


250 


47 


.8 


671 


24 


ABT09479 


Phase -1 Rat CT gen 


34 


237 


.6 


45 


.4 


317 


24 


AAS16882 


Human insulin- like 


35 


237 


.6 


45 


. 4 


318 


22 


AAD06403 


Human liver -type I 


36 


237 


.6 


45 


.4 


462 


19 


AAV50426 


Human IGF-1 encodi 


37 


237 


.6 


45 


.4 


462 


19 


AAV4 0794 


Human IGF-I coding 


38 


237 


.6 


45 


.4 


462 


24 


ABZ35734 


Human IGF1 polynuc 


39 


237 


.6 


45 


.4 


462 


24 


ABX09977 


Human IGF1 DMA fra 


40 


237 


. 6 


45 


.4 


462 


24 


ABV78158 


Human IGF1 DNA SEQ 


41 


237 


.6 


45 


.4 


462 


24 


ABL91699 


Human polynucleoti 


42 


209 


40 


.0 


286 


25 


ABV76186 


Mouse insulin-like 


43 


193 


36 


. 9 


210 


24 


AAD45568 


Human insulin- like 


44 


193 


36 


.9 ' 


210 


24 


AAD44955 


Human insulin grow 


45 


193 


36 


.9 


210 


24 


ABA03146 


Native mature IGF- 



ALIGNMENTS 



RESULT 1 
AAD06400 

ID AAD06400 standard; cDNA; 523 BP. 
XX 

AC AAD064 00; 
XX 

DT 10-AUG-2001 (first entry) 
XX 

DE Rabbit IGF- 1 isoform mechano -growth factor (MGF-) cDNA. 
XX 

KW Rabbit; IGF-I isoform; Insulin-like Growth Factor-I; MGF; 

KW mechano-growth factor; neurological disorder; neurodegenerative disorder; 

KW amyotrophic lateral sclerosis; spinal muscular atrophy; muscular atrophy; 

KW poliomyelitis; post-polio syndrome; toxin; motoneurone disorder; 

KW nerve damage; autosomal muscular dystrophy; diabetic neuropathy; 

KW sex- linked muscular dystrophy; peripheral neuropathy; 

KW Alzheimer's disease; Parkinson's disease; ss. 

XX 

OS Oryctolagus cuniculus . 



XX 

FH Key Location/Qual if iers 

FT CDS 1 . .336 

FT /*tag= a 

FT /product= "Mechano-growth factor (MGF) " 

FT /note= "This region comprises exons 3-6. The CDS does 

FT not include start codon" 

FT /partial 

XX 



PN WO200136483-A1. 
XX 

PD 25-MAY-2001. 
XX 

PF 15-NOV-2000; 2 000WO-GB04354 . 
XX 

PR 15-NOV-1999; 99GB-0026968 . 
XX 

PA (UNLO } UNIV COLLEGE LONDON. 
XX 

PI Goldspink G, Johnson I; 
XX 

DR WPI; 2001-355620/37. 

DR P-PSDB; AAE02449. 
XX 

PT Use of mechano-growth factor, an isoform of Insulin-like Growth 

PT Factor-I, capable of reducing motoneurone loss, in the manufacture of a 

PT medicament for the treatment of neurological disorder - 

XX 

PS Claim 4; Page 53-54; 66pp; English. 
XX 

CC The present invention relates to use of mechano-growth factor (MGF) , 

CC an Insulin- like Growth Factor-I (IGF-I) isoform in the manufacture of a 

CC medicament for the treatment of neurological disorder. The MGF is capable 

CC of reducing motoneurone loss by 2 0% or greater in response to nerve 

CC avulsion, and effects motoneurone rescue, preferably adult motoneurone 

CC rescue. The MGF polynucleotide and polypeptide are useful in the 

CC manufacture of a medicament for the treatment of a neurological disorder, 

CC including a disorder of motoneurones and/or neurodegenerative disorder, 



CC e.g., amyotrophic lateral sclerosis, spinal muscular atrophy, progressive 

CC spinal muscular atrophy, infantile or juvenile muscular atrophy, 

CC poliomyelitis or post -polio syndrome, a disorder caused by exposure to a 

CC toxin, motoneurone trauma, a motoneurone lesion or nerve damage, an 

CC injury that affects motoneurones , motoneurone loss associated with aging, 

CC autosomal or sex-linked muscular dystrophy, diabetic neuropathy, 

CC peripheral neuropathies, Alzheimer's disease and Parkinson's disease. 

CC The present sequence is rabbit IGF-I isoform MGF cDNA. MGF is a muscle 

CC isoform having extracellular (Ec) domain, hence also referred as 

CC IGF-I-Ec. The MGF protein comprises amino acid sequences encoded by 

CC nucleic acid sequence of IGF-I exons 4, 5 and 6 in the reading frame 

CC of MGF. 

XX 

SQ Sequence 523 BP; 154 A; 129 C; 142 G; 98 T; 0 other; 

Query Match 100.0%; Score 523; DB 22; Length 523; 

Best Local Similarity 100.0%; Pred. No. 5.1e-144 ; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 GGAC CGGAGACG CTCTG CGGTG CTGAG CTGGTGGATG CT CTTCAGTT CGTGTGTGGAGA C 60 

I Mill III Ml II I III II I II II I Mill 1 1! II 1 : 1 1 1 ! I i 1 1 1 1 1 ! I M I 

Db 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTTTTATTTO^C^GCCC^^ 12 0 

MMMMMIIMMMMMIMIMMIMMMMMMIMMIMMMIMM 

Db 61 AGGGGCTTTTATTTCAAC^GCCCACAGGATACGGCTCCAGC^GTCGGAGGGC^CCTC^G 12 0 

Qy 121 ACAGG CATCGTGGATGAGTG CTG CTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

II II M I M 1 1 1 M I M II 1 1 1 II 1 1 M II I M II II II M M I II I II II I II II II II 

Db 121 ACAGG CATCGTGGATGAGTG CTG CTTCCGGAGCTGTGAT CTGAGGAGG CTGGAGATGTAC 18 0 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

1 1 1 1 1 II li I M I II il . Ml 1 1 ! J 1 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 M iM li 1 1 

Db 181 TGTGC^CCCCTCAAGCCGGCAAAGGOVGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

Qy 241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACC^^ 3 00 

I M I M II 1 1 II II 1 1 1 II 1 1 1 1 1 1 1 II II I II II II II II I II M I II M I M II II II 

Db 241 ATGCCC^GACTCAGAAGTATC^GCCTCCATCTACC^C^GAAAATGAAGTCTCAGAGG 3 00 

Qy 301 AGAAGGAAAGGAAGTAC^TTTGAAGAACAC^GTAGAGGGAGTGCAGGAAACAAGAACTA 360 

Mill IMIIIIIIIIMM Mill IIMM Mill MM IIMIIIIIIMIII Mill 

Db 301 AGAAGGAAAGGAAGTACATTTGAAGAACAC^GTAGAGGGAGTG(^GGAAACAAGAACTA 36 0 

Qy 3 61 CAGGATGTAGGAAGACC CTT CTGAGGAGTGAAGAAGGACAGG CCAC CGCAGGACCCTTTG 42 0 

I I M II 1 1 1 M M II 1 1 1 1 1 1 II II I II II 1 1 1 II 1 1 II 1 1 II I II I M II II II II 1 1 1 

Db 3 61 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 42 0 

Qy 421 CTCTG(^CAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTT 48 0 

M 1 1 1 MM III MMI I M Ml I M III 1 1 1 II 1 1 1 M M M I M M I M 1 1 MM 

Db 421 CTCTG C^CAGTTACCTGTAAACATTGGAATACCGGCCAA 48 0 

Qy 481 CAAAGATGGC^TTTCCCCC^TGAAATACACAAGTAAACATTC 523 

M 1 1 1 1 Ml Ml MMI M M MM I Ml 1 1 1 Ml 1 1 1 1 1 

Db 481 CAAAGATGGCATTTC CC CCAATGAAATACACAAGTAAACATT C 523 



RESULT 2 



AAS16879 

ID AAS16879 standard; cDNA ; 523 BP. 
XX 

AC AAS16879; 
XX 

DT 25-FEB-2002 (first entry) 
XX 

DE Rabbit mechano -growth factor (MGF) cDNA. 
XX 

KW Rabbit; mechano -growth factor; insulin-like growth factor I; IGF-I; MGF; 

KW neuroprotective; nerve damage; peripheral nervous system; nerve severing; 

KW muscle; neurological disorder; motoneuron loss; motorneuron disorder; ss; 

KW nerve avul s ion . 
XX 

OS Oryctolagus cuniculus. 



XX 

FH Key Location/Qualifiers 

FT CDS 1..336 

FT /*tag= a 

FT /product = "Rabbit MGF" 

FT /partial 

FT /note= "No start codon" 

FT exon 1 . . 76 

FT /*tag= b 

FT /number^ 3 

FT exon 77.-259 

FT /*tag= c 

FT /number= 4 

FT exon 260 . .309 

FT /*tag= d 

FT /number= 5 

FT exon 3 11.. 333 

FT /*tag= e 

FT /number^ 6 

XX 



PN WO200185781-A2 . 
XX 

PD 15-NOV-2001. 
XX 

PF 10-MAY-2001; 2 00 1WO-GB02 054 . 
XX 

PR 10-MAY-2000; 2 00 0GB- 0 01 12 78 . 
XX 

PA (UNLO ) UN IV COLLEGE LONDON. 

PA (EGRI-) EAST GRINSTEAD MEDICAL RES TRUST. 

XX 

PI Goldspink G, Terenghi G; 
XX 

DR WPI; 2002-055585/07. 

DR P-PSDB; AAU10561 . 
XX 

PT Use of insulin-like growth factor I (IGF-I) isoform known as 

PT mechano-growth factor which is encoded by IGF-I exons 4,5,6 and has 

PT ability to reduce motoneuron loss in response to nerve avulsion, to 

PT treat nerve damage 

XX 

PS Disclosure; Fig 7; 65pp; English. 



XX 

CC The invention relates to the use of an insulin-like growth factor I 

CC (IGF-I) isoform, known as mechano -growth factor (MGF) , in the manufacture 

CC of a medicament for treating nerve damage in the peripheral nervous 

CC system, or for treating nerve damage by localising MGF at the site of 

CC damage. The nerve damage may include severing of a nerve. The treatment 

CC may be combined with another treatment (such as a polypeptide growth 

CC factor other than MGF) that prevents or diminishes degeneration of the 

CC target organ (for example, muscle) which the damaged nerve innervates, 

CC whereby the treatment of the muscle with MGF or a polynucleotide encoding 

CC MGF prevents or diminishes degeneration .' The method is useful for 

CC treating neurological disorders, preferably motorneuron disorders. These 

CC methods can reduce motoneuron loss by 2 0% or greater in response to nerve 

CC avulsion. This sequence represents cDNA encoding the rabbit MGF. 

XX 

SQ Sequence 523 BP; 154 A; 129 C; 142 G; 98 T; 0 other; 

Query Match 100.0%; Score 523; DB 24; Length 523; 

Best Local Similarity 100.0%; Pred. No. 5.1e-144; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

II M I Ml 1 1 1 II MM 1 1 1 II 1 1 1 1 1 M 1 1 1 1 II 1 1 MMM M Ml 1 1 M I II I II 

Db 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCC^Gt^GTCGGAGGGCACCTCAG 120 

II II I III II 1 11 II I M Ml M 1 1 MM! 1 1 1 1 II 1 1 1 1 1 1 1 IMIII I III 1 1 MM I 

Db 61 AGGGGCTTTTATTTCAACAAGCCC^CAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

1 1 1 M I Ml M I Ml 1 1 1 IMM M III I M II 1 1 II 1 1 II M 1 1 1 1 M I IMM 1 1 1 

Db . 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

Qy 181 TGTGCACCCCTCAAGCCGGOU^GGCA£^ 24 0 

1 1 II 1 1 1 II M I Ml 1 1 1 Ml 1 1 M IM 1 1 1 Ml I Ml I M 1 1 1 1 1 1 M I M II 1 1 1 1 

Db 181 TGTGCACCCCTCAAGCCGGO^GGCAGCCCGCTCCGTCCGTGCCOVGCGCCACACCGAC 24 0 

Qy 241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCT^ 300 

M I M IMII 1 1 III 1 1 1 II IM 1 1 III I MM 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II II I Ml 

Db 241 ATGCCCAAGACTCAGAAGTATCAGCCTCCA^ 300 

Qy 3 01 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

M I II MM II I III 1 1 1 II III II III I IMM I II 1 1 II 1 1 1 1 1 1 1 II IMM 1 1 1 

Db 301 AGAAGGAAAGGAAGTAC^TTTGAAGAACAC^GTAGAGGGAGTGCAGGAAACAAGAACTA 360 

Qy 361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGG C CACCG CAGGACCCTTTG 42 0 

II 1 1 1 1 II 1 1 1 1 Ml I IM II M 1 1 MM 1 1 IM 1 1 1 1 1 1 1 1 1 IMII I MM M MM 

Db 361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCG CAGGACCCTTTG 42 0 

Qy 421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGAT(^C^TTT 48 0 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 421 CTCTGCACAGTTACCTGTAAACATTGGAATACCG 48 0 

Qy 481 CAAAGATGGCATTTCCCCCAATGAAATAt^a^GTAAACATTC 523 

1 1 1 1 II I IM 1 1 Ml 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 II 1 1 II MM 

Db 481 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 



RESULT 3 
AAT84 8 93 

ID AAT84893 standard; cDNA ; 553 BP. 
XX 

AC AAT84893; 
XX 

DT 14-APR-1998 (first entry) 
XX 

DE Rabbit insulin like growth factor 1 encoding cDNA . 
XX 

KW Insulin like growth factor 1; IGF-1; Ec peptide; muscle disorder; 

KW heart; neuromuscular disease; primer; ss. 

XX 

OS Oryctolagus cuniculus . 
XX 

FH Key Location/Qualif iers 

FT CDS 1. .366 

FT /*tag= a 

FT /product^ " IGF-1" 

XX. 

PN W09733997-A1 . 
XX 

PD 18-SEP-1997. 
XX 

PF ll-MAR-1997; 97WO-GB00658 . 
XX 

PR ll-MAR-1996; 96GB- 0005124 . 
XX 

PA (UNLO ) ROYAL FREE HOSPITAL SCHOOL MED. 
XX 

PI Goldspink G; 
XX 

DR WPI; 1997-470877/43. 

DR P-PSDB; AAW23301. 
XX 

PT Use of insulin like growth factor I characterised by presence of Ec 

PT peptide - to treat humans or animals, particularly muscle disorders, 

PT heart conditions or neuromuscular diseases 
XX 

PS Disclosure; Fig 3; 33pp; English. 
XX 

CC A use of insulin like growth factor I (IGF-1) has been developed, and 

CC is characterised by the presence of the Ec peptide, or a functional 

CC equivalent, in the treatment or therapy of a human or animal. The IGF-1 

CC polypeptide can be used to treat muscular disorders, e.g. Duchenne or 

CC Becker muscular dystrophy, autosomal dystrophies and related progressive 

CC skeletal muscle weakness and wasting, muscle atrophy in ageing humans, 

CC spinal cord injury induced muscle atrophy and neuromuscular diseases, 

CC and cardiac disorders, e.g. diseases where promotion of cardiac muscle 

CC protein synthesis is a beneficial treatment, cardiomyopathies and acute 

CC heart failure or insult, specifically myocarditis or myocardial 

CC infarction. It can also be used to promote bone fracture healing and 

CC maintenance of bone in old age. The present sequence encodes rabbit 

CC IGF-1 used in the present specification. 
XX 

SQ Sequence 553 BP; 159 A; 142 C; 147 G; 105 T; 0 Other; 



Query Match 100.0%; Score 523; DB 18; Length 553; 

Best Local Similarity 100.0%; Pred. No. 5.3e-144; 

. Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

1 1 1 1 1 m ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 ii ii 1 1 1 1 1 1 1 1 1 1 1 1 ri 1 1 ii ii 

Db 31 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 90 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 

III Mllllllll IMMIIir MIIIIIM illlllllil IIIIMII 

Db 91 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 150 



Qy 121 ACAGG CAT CGTGGATGAGTGCTGCTTC CGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

1 1 1; ii i 1 1 1 i-M' ■! II 1 1 1 i 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I I I I I I 

Db 151 ACAGG CATCGTGGATGAGTG CTG CTT CCGGAGCTGTGAT CTGAGGAGG CTGGAGATGTAC 210 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCT 24 0 

MM IMIIMIII IMMI Ml IMMIMM MIMMIM IMMMMI 

Db 211 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 270 

Qy 241 ATG C CCAAGACT CAGAAGTATCAG C CT CCATCTAC CAACAAGAAAATGAAGTCTCAGAGG 3 00 

1 1 MIIM I II M : II II 1 1 II 1 1 II II II 1 1 1 1 1 1 II 1 1 1 1 1 M Ml 1 1 1 1 1 

Db 271 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAACA^ 330 

Qy 3 01 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 3 60 

M I M 1 1 M 1 1 II MIM II I M I Ml M I M 1 1 1 1 M I II I M I M Ml 

Db 331 AGAAGGAAAGGAAGTACA.TTTGAAGAACACAAGTAGAGGGAGTGCAG 3 90 

Qy 361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 420 

II: IIMIMM' IMIIIIII Illllllll MMIIMM IMIIIIIII 

Db 3 91 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 450 

Qy 421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTT 4 80 

MINI Mill MM II MINI IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 451 CTCTGC^CAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACA 510 

Qy 481 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

II MMIIMM Illllllll MMIIIIMIMIII 

Db 511 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACATTC 553 



RESULT 4 
AAD06398 

ID AAD06398 standard; cDNA; 517 BP. 
XX 

AC AAD063 98; 
XX 

DT 10-AUG-2001 (first entry) 
XX 

DE Human IGF- I isoform mechano -growth factor (MGF) cDNA. 
XX 

KW Human; IGF-I isoform; Insulin-like Growth Factor-I; MGF; 

KW mechano -growth factor; neurological disorder; neurodegenerative disorder; 

KW amyotrophic lateral sclerosis; spinal muscular atrophy; muscular atrophy; 

KW poliomyelitis; post -polio syndrome; toxin; motoneurone disorder; 

KW nerve damage; autosomal muscular dystrophy; diabetic neuropathy; 



KW sex-linked muscular dystrophy; peripheral neuropathy; 

KW Alzheimer's disease; Parkinson's disease; ss." 

XX 

OS Homo sapiens. 



XX 

FH Key Location/Qualifiers 

FT CDS 1..333 
FT /*tag= a 

FT /product^ " Me chano -growth factor (MGF) " 

FT /note= "This region comprises exons 3-6. The CDS does 

FT not include start codon" 

FT /partial 

XX 



PN WO200136483-A1. 
XX 

PD 25-MAY-2001. 
XX 

PF 15-NOV-2000; 2000WO-GB04354 . 
XX 

PR 15-N0V- 1.999; 99GB-0026968 . 
XX 

PA (UNLO ) UN IV COLLEGE LONDON. 
XX 

PI Goldspink G, Johnson I; 
XX 

DR WPI; 2001-355620/37. 

DR P-PSDB; AAE02447. 
XX 

PT Use of mechano- growth factor, an isoform of Insulin-like Growth 

PT Factor-I, capable of reducing motoneurone loss, in the manufacture of a 

PT medicament for the treatment of neurological disorder - 

XX 

PS Claim 4; Page 4 9-50; 66pp; English. 
XX 

CC The present invention relates to use of mechano -growth factor (MGF) , 

CC an Insulin-like Growth Factor-I (IGF-I) isoform in the manufacture of a 

CC medicament for the treatment of neurological disorder. The MGF is capable 

CC of reducing motoneurone loss by 20% or greater in response to nerve 

CC avulsion, and effects motoneurone rescue, preferably adult motoneurone 

CC rescue. The MGF polynucleotide and polypeptide are useful in the 

CC manufacture of a medicament for the treatment of a neurological disorder, 

CC including a disorder of motoneurones and/or neurodegenerative disorder, 

CC e.g., amyotrophic lateral sclerosis, spinal muscular atrophy, progressive 

CC spinal muscular atrophy, infantile or juvenile muscular atrophy, 

CC poliomyelitis or post-polio syndrome, a disorder caused by exposure to a 

CC toxin, motoneurone trauma, a motoneurone lesion or nerve damage, an 

CC injury that affects motoneurones, motoneurone loss associated with aging, 

CC autosomal or sex-linked muscular dystrophy, diabetic neuropathy, 

CC peripheral neuropathies, Alzheimer's disease and Parkinson's disease. 

CC The present sequence is human IGF-I isoform MGF cDNA. MGF is a muscle 

CC isoform having extracellular (Ec) domain, hence also referred as 

CC IGF-I-Ec. The MGF protein comprises amino acid sequences encoded by 

CC nucleic acid sequence of IGF-I exons 4, 5 and 6 in the reading frame 

CC of MGF. 

XX 

SQ Sequence 517 BP; 150 A; 130 C; 139 G; 98 T; 0 other; 



Query Match 89.4%; Score 467.4; DB 22; Length 517; 

Best Local Similarity 96.2%; Pred. No. 1.3e-127; 

Matches 501; Conservative 0; Mismatches 16; Indels 4; Gaps 2; 



Qy 1 GGACCGGAGACG CTCTG CGGTG CTGAG CTGGTGGATG CTCTTCAGTT CGTGTGTGGAGAC 60 

I I I I II I I I I M I I I II I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 GGACCGGAGACG CTCTGCGGGGCTGAG CTGGTGGATG CTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCC 12 0 

1 1 1 1 1 1 1 i ii i ii i r i ii 1 1 1 m: ii M 1 1 1 1 1 1 1 1 ii 1 1 ii ii mill 

Db 61 AGGGGCTTTTATTTCAACAAGCCCAC^GGGTATGGCTCCAGC^GTCGGAGGGCGCCTCAG 120 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

lllllll IMIIIII MIIIIMI MIMMIMM II llllllll II 

Db 121 ACAGG CATCGTGGATGAGTGCTG CTTC CGGAGCTGTGATCTAAGGAGG CTGGAGATGTAT 180 

Qy 181 TGTGCACCCCTC^GCCGGCAAAGGCAGCCCGCT^ 24 0 

II MIMMMMIM II III 1 1 1 1 Mill MMMIMMMMMMIMM 

Db 181 TGCGCACCCCTC^GCCTGCCAAGTC^GCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 240 

Qy 241 ATGCC CIAAGACTCAGAAGTATCAG CCT CCATCTACCAACAAG 3 00 

I IN 1 1 MM I I III Ml Mill M Ml II 1 II lllllll I MIIIIMI I 

Db 241 ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAACAAGAACACGA — G 2 97 

Qy 301 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAA^ 3 60 

1 1 1 1 1 II 1 1 1 M 1 1 II 1 1 M 1 1 1 1 1 1 II II II II 1 1 M 1 1 II I II M 1 1 M II 1 1 1 1 M I 

Db 2 98 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGA 357 

Qy 361 CAGGATGTAGGAAGAC CCTTCTGAGGAGTGAAGAAGGACAGGC CAC CGCAGGAC CCTTTG 42 0 

MIIIIMI 1 1 1 II M II I II I M I II II II II 1 1 II II 1 1 1 1 II II I II 1 1 1 II II II 

Db 358 CAGGATGTA - GAAGAC CCTTCTGAGGAGTGAAGAAGGACAGG CCACCGCAGGAC CCTTTG 416 

Qy 421 CTCTG CACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATC^ 48 0 

MM IMIMM MMMM IMMIIMI MIMIII llllllll MIMM 

Db 417 CTCTGCACAGTTACCTGTAAAC^TTGGAATACCGGCCAAAAAATAAGTTTGATCACATTT 476 

Qy 481 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 521 

MM IIIIIMM IIIIIIM llllllll IIMM 

Db 477 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 



RESULT 5 
AAS16877 

ID AAS16877 standard; cDNA; 517 BP. 
XX 

AC AAS16877; 
XX 

DT 25-FEB-2002 {first entry) 
XX 

DE Human mechano -growth factor (MGF) cDNA. 
XX 

KW Human; mechano -growth factor; insulin-like growth factor I; IGF-I; MGF; 

KW neuroprotective; nerve damage; peripheral nervous system; nerve severing; 

KW muscle; neurological disorder; motoneuron loss; motorneuron disorder; ss; 

KW nerve avulsion. 
XX 

OS Homo sapiens. 



XX 

FH Key Location/Qualifiers 

FT CDS 1..333 

FT /*tag= a 

FT /product = "Human MGF" 

FT /partial 

FT /note= "No start codon" 

FT exon 1 . . 76 

FT /*tag= b 

FT /number = 3 

FT exon 77. .259 

FT /*tag= c 

FT /number^ 4 

FT exon 260. .3 07 

FT /*tag= d 

FT / number = 5 

FT exon 308 . .330 

FT /*tag= e 

FT /number= 6 

XX 

PN WO200185781-A2 . 
XX 

PD 15-NOV-2001. 
XX 

PF 10-MAY-2001; 2 001WO-GB02 054 . 
XX 

PR 10-MAY-2000; 2 000GB- 00 11278 . 
XX 

PA (UNLO ) UN IV COLLEGE LONDON. 

PA (EGRI-) EAST GRINSTEAD MEDICAL RES TRUST. 

XX 

PI Goldspink G, Terenghi G; 
XX 

DR WPI; 2002-055585/07. 

DR P-PSDB; AAU10559. 
XX 

PT Use of insulin-like growth factor I (IGF-I) isoform known as 

PT mechano-growth factor which is encoded by IGF-I exons 4,5,6 and has 

PT ability to reduce motoneuron loss in response to nerve avulsion, to 

PT treat nerve damage 

XX 

PS Claim 11; Fig 5; 65pp; English. 
XX 

CC The invention relates to the use of an insulin-like growth factor I 

CC (IGF-I) isoform, known as mechano-growth factor (MGF), in the manufacture 

CC of a medicament for treating nerve damage in the peripheral nervous 

CC system, or for treating nerve damage by localising MGF at the site of 

CC damage. The nerve damage may include severing of a nerve. The treatment 

CC may be combined with another treatment (such as a polypeptide growth 

CC factor other than MGF) that prevents or diminishes degeneration of the 

CC target organ (for example, muscle) which the damaged nerve innervates, 

CC whereby the treatment of the muscle with MGF or a polynucleotide encoding 

CC MGF prevents or diminishes degeneration. The method is useful for 

CC treating neurological disorders, preferably motorneuron disorders. These 

CC methods can reduce motoneuron loss by 2 0% or greater in response to nerve 

CC avulsion. This sequence represents cDNA encoding the human MGF. 

XX 



SQ Sequence 517 BP; 150 A; 130 C; 139 G; 98 T; 0 other; 



Query Match 89.4%; Score 467.4; DB 24; Length 517; 

Best Local Similarity 96.2%; Pred. No. 1.3e-127; 

Matches 501; Conservative 0; Mismatches 16; Indels 4; Gaps 2; 
Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

MIIIIIIM Mllllli Mill MM IMMMM MMMMIMM Mill 

Db 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 



Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 

MIIIIIIIIIIIIMMIIMIIIIMI II IMMIMMMMIIMM III II I 

Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 120 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

IIIIIMIIIIIIIIIIIIIIIIIMIIMMIIIMIIII 

Db 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 180 



Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

II II II III Ml III I II Ml MM Mill 

Db 181 TGCGCACCCCTC^AGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

Qy 241 ATGCCCAAGACTCAGAAGTATCAGCCTCCA^ 3 00 

I i Mill! I MIIIIIIIMIII III II III III Mill I I MUM III I 

Db 241 ATGCCCAAGACCCAGAAGTATC^GCCCCCATC G 297 

Qy 3 01 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

II I Mill Ml II I II I MM M II 1 1 1 1 1 Ml 1 1 II II 1 1 1 1 Ml IM Mill 

Db 2 98 AGAAGGAAAGGAAGTACATTTGAAGAAf^C^GTAGAGGGAGTGCAGGAAACAAGAACTA 357 

Qy 361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 42 0 

MIMIIII MMMIIIMMMIMMIIIMM I llllllllllllll I II I MM 

Db 3 58 CAGGATGTA-GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 416 

Qy 421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTT 48 0 

llllilll III III MIMIIII lllllll IIIIIMIIMMIMIMIIill illlll 

Db - 417 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTT 4 76 



Qy 481 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 521 

Mllllli I III I MM Mill M MM 

Db 4 77 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 517 



RESULT 6 
AAD06405 

ID AAD06405 standard; cDNA; 471 BP. 
XX 

AC AAD064 05; 
XX 

DT 10-AUG-2001 (first entry) 
XX 

DE Rabbit liver-type IGF-I isoform (L.IGF-I) cDNA. 
XX 

KW Rabbit; IGF-I isoform; Insulin-like Growth Factor-I; MGF; 

KW mechano -growth factor; neurological disorder; neurodegenerative disorder; 

KW amyotrophic lateral sclerosis; spinal muscular atrophy; muscular atrophy; 

KW poliomyelitis; post -polio syndrome; toxin; motoneurone disorder; 



KW nerve damage; autosomal muscular dystrophy; diabetic neuropathy; 

KW sex-linked muscular dystrophy; peripheral neuropathy; 

KW Alzheimer's disease; Parkinson's disease; liver; L.IGF-I; ss. 

XX 

OS Oryctolagus cuniculus . 
XX 

FH Key Location/Qual if iers 

FT CDS 1. -318 

FT /*tag= a 

FT /product^ "Liver-type IGF-I isoform (L.IGF-I)" 

FT /transl_except= (pos:7..9, aa:Gln) 

FT /transl_except= (pos:25..27, aa:Gln) 

FT /note= "These translation exceptions occur while decoding 

FT the alternative version of the protein (AAE02456) . 

FT The CDS comprises exons 3/4 and 6 and 

FT does not include start codon" 

FT /partial 

XX 

PN WO200136483-A1. 
XX 

PD 25-MAY-2001. 
XX 

PF 15 -NOV- 2000; 2 000WO-GB04354 . 
XX 

PR 15-NOV-1999; 99GB-0026968 . 
XX 

PA (UNLO ) UN IV COLLEGE LONDON. 
XX 

PI Goldspink G, Johnson I; 
XX 

DR WPI; 2001-355620/37. 

DR P-PSDB; AAE02452, AAE02456. 

XX 

PT Use of mechano -growth factor, an isoform of Insulin-like Growth 

PT Factor- I, capable of reducing motoneurone loss, in the manufacture of a 

PT medicament for the treatment of neurological disorder - 

XX 

PS Disclosure; Page 59-60; 66pp; English. 
XX 

CC The. present invention relates to use of mechano -growth factor (MGF) , 

CC an Insulin-like Growth Factor-I (IGF-I) isoform in the manufacture of a 

CC medicament for the treatment of neurological disorder. The MGF is capable 

CC of reducing motoneurone loss by 20% or greater in response to nerve 

CC avulsion, and effects motoneurone rescue, preferably adult motoneurone 

CC rescue. The MGF polynucleotide and polypeptide are useful in the 

CC manufacture of a medicament for the treatment of a neurological disorder, 

CC including a disorder of motoneurones and/or neurodegenerative disorder, 

CC e.g., amyotrophic lateral sclerosis, spinal muscular atrophy, progressive 

CC spinal muscular atrophy, infantile or juvenile muscular atrophy, 

CC poliomyelitis or post-polio syndrome, a disorder caused by exposure to a 

CC toxin, motoneurone trauma, a motoneurone lesion or nerve damage, an 

CC injury that affects motoneurones, motoneurone loss associated with aging, 

CC autosomal or sex-linked muscular dystrophy, diabetic neuropathy, 

CC peripheral neuropathies, Alzheimer's disease and Parkinson's disease. 

CC The present sequence is rabbit liver-type IGF-I isoform (L.IGF-I) cDNA. 

CC The L.IGF-I protein comprises amino acid sequences encoded by 

CC nucleic acid sequence of IGF-I exons 4 and 6. 



XX 

SQ Sequence 471 BP; 132 A; 118 C; 131 G; 90 T; 0 other; 



Query Match 78.2%; Score 409; DB 22; Length 471; 

Best Local Similarity 90.1%; Pred. No. 2e-110; 

Matches 471; Conservative 0; Mismatches 0; Indels 52; Gaps 1; 



Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

MMIIMM MMIMMMM. IMMMIMM IMIMMMIM Ml 

Db 1 GGACCGGAGACG CT CTGCGGTG CTGAGCTGGTGGATG CT CTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

MIIIMM MIMIMMIII MIMIMIMM MMIMMMM MM 

Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG' 120 

Qy 121 ACAGG CATCGTGGATGAGTGCTG CTT CCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

I I I II I I I I I I I I II II I II I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I. 

Db 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

MM Mill I II III IlllMIIII MM IlllMIIII I II II III II MM II II I II 

Db 181 TGTGOVCCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCC^CACCGAC 24 0 

Qy 241 ATGCCCAAGACTCAGAAGTATC71GCCTCCATCTAC 300 

I I i M 1 1 1 1 1 1 i I 

Db 241 ATGCCCAAGACTCAG 255 

Qy 301 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGG 36 0 

M MIMMMMM MMIMIMM IMMMMMM MIMI 

Db 256 AAGGAAGTA(^TTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 308 

Qy 361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 42 0 

1 1 1 1 1 1 1 1 M 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 

Db 309 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 368 

Qy 421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTT 48 0 

II II I I II I I II II I II II II II I M II I II I II I M I I I I I II I I I I I I I I I I II II I I 

Db 3 69 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTT 42 8 

Qy 481 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

MM Mill III 1 1 ! 1 1 1 1 1 1 1 i 1 : 1 1 ! 1 1 1 i 1 1 III Mill 

Db 42 9 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACATTC 471 



RESULT 7 
AAS16884 



ID AAS16884 standard; cDNA; 471 BP. 
XX 

AC AAS16884; 
XX 

DT 25-FEB-2002 (first entry) 
XX 

DE Rabbit insulin-like growth factor I liver-type isoform (L.IGF-I) cDNA. 
XX 

KW Rabbit; mechano -growth factor; insulin-like growth factor I; IGF-I; MGF; 
KW neuroprotective; nerve damage; peripheral nervous system; nerve severing; 
KW muscle; neurological disorder; motoneuron loss; motorneuron disorder; ss; 



KW nerve avulsion; insulin-like growth factor I liver-type isoform; L.IGF-I; 
XX 

OS Oryctolagus cuniculus . 
XX 

FH Key Locat ion/Qualif iers 

FT CDS 1- -318 

FT • /*tag= a 

FT /product= "Rabbit L.IGF-I" 

FT /partial 

FT /note= "No start codon" 

FT exon 1 . .75 

FT /*tag= b 

FT /number- exon 3 

FT exon 76 . .258 

FT /*tag= c 

FT /number^ exon 4 

FT exon 259. .315 

FT /*tag= d 

FT /number= exon 6 

XX 

PN WO200185781-A2 . 
XX 

PD 15-NOV-2001. 
XX 

PF 10-MAY-2001; 2001WO-GB02054 . 
XX 

PR 10-MAY-2000; 2000GB-0011278 . 
XX 

PA (UNLO ) UNIV COLLEGE LONDON. 

PA (EGRI-) EAST GRINSTEAD MEDICAL RES TRUST. 

XX 

PI Goldspink G, Terenghi G; 
XX 

DR WPI; 2002-055585/07. 

DR P-PSDB; AAU10564 . 
XX 

PT Use of insulin- like growth factor I (IGF- I) isoform known as 

PT mechano -growth factor which is encoded by IGF- I exons 4,5,6 and has 

PT ability to reduce motoneuron loss in response to nerve avulsion, to 

PT treat nerve damage 

XX 

PS Disclosure; Fig 10; 65pp; English. 
XX 

CC The invention relates to the use of an insulin-like growth factor I 

CC (IGF-I) isoform,. known as mechano -growth factor (MGF) , in the manufacture 

CC of a medicament for treating nerve damage in the peripheral nervous 

CC system, or for treating nerve damage by localising MGF at the site of 

CC damage. The nerve damage may include severing of a nerve. The treatment 

CC may be combined with another treatment (such as a polypeptide growth 

CC factor other than MGF) that prevents or diminishes degeneration of the 

CC target organ (for example, muscle) which the damaged nerve innervates, 

CC whereby the treatment of the muscle with MGF or a polynucleotide encoding 

CC MGF prevents or diminishes degeneration. The method is useful for 

CC treating neurological disorders, preferably motorneuron disorders. These 

CC methods can reduce motoneuron loss by 20% or greater in response to nerve 

CC avulsion. This sequence represents cDNA encoding the rabbit insulin-like 

CC growth factor I liver-type isoform (L.IGF-I.) used in experiments on 



CC motoneuron loss. 
XX 

SQ Sequence 471 BP; 132 A; 118 C; 131 G; 90 T; 0 other; 



Query Match 78.2%; Score 409; DB 24; Length 471; 

Best Local Similarity 90.1%; Pred. No. 2e-110; 

Matches 471; Conservative 0; Mismatches 0; Indels 52; Gaps 1; 



Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I . ^ I I I I II ! I I I II I I I III II I II II I I I I M II i M M 

Db 1 GGACCGGAGACGCTCTGCGGTGGTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTTTTATTTCAAC7UVGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II . 

Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

Qy 121 ACAGG CATCGTGGATGAGTGCTG CTT CCGGAG CTGTGAT CTGAGGAGGCTGGAGATGTAC 180 

I I I I I f I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 

Db 121 ACAGGCAT CGTGGATGAGTG CTG CTTCCGGAG CTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

mi 1 1 1 1 1 1 1 M II 1 1 1 1 IM I IMI I II I II II I II III 1 1 1 1'! 1 1 1 III I 

Db 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

Qy 241 ATGCCCAAGACTC^GAAGTATCAGCCTCCATCTACCAACAAGAAAATGAAGTCTCAGAGG 300 

Illllllllllllll 

Db 241 ATGCCCAAGACTCAG 255 

Qy 3 01 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTG 360 

IIIIIIIIIIIIMIIIIMIIIIIIMIIIMIIIIIIIIIIIIIMMIII 

Db 2 56 AAGGAAGTAC^TTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 3 08 

Qy 361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 42 0 

. I MM 1 1 MM II I II 1 1 1 IM MM 1 1 1 1 1 II M I Ml II I II I II I II Mill 1 1 1 

Db 3 09 CAGGATGTAGGAAGACCCTT CTGAGGAGTGAAGAAGGACAGGC CACCG CAGGAC CCTTTG 368 

Qy 421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTT 480 

I M 1 1 II MM IMm 1 1 1 1 1 1 MM 1 1 1 1 Ml 1 1 1 MM II II I M I IMI I II II I 

Db 369 CTCTGCACAGTTACCTGTAAACATTGGAATACCG 428 

Qy 481 CAAAGATGG CATTT C CC CCAATGAAATACACAAGTAAACATT C 523 

II 1 1 1 MMM M MM 1 1 1 1 1 1 III 1 1 1 M MM 1 1 1 Ml I II 

Db 42 9 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACATTC 471 



RESULT 8 
AAD06399 



ID AAD06399 standard; cDNA; 539 BP. 
XX 

AC AAD063 99; 
XX 

DT 10-AUG-2 001 (first entry) 
XX 

DE Rat IGF-I isoform mechano- growth factor (MGF) cDNA . 
XX 

KW Rat; IGF-I isoform; Insulin-like Growth Factor-I; MGF; 

KW mechano -growth factor; neurological disorder; neurodegenerative disorder; 



KW amyotrophic lateral sclerosis; spinal muscular atrophy; muscular atrophy; 

KW poliomyelitis; post-polio syndrome; toxin; motoneurone disorder; 

KW nerve damage; autosomal muscular dystrophy; diabetic neuropathy; 

KW sex-linked muscular dystrophy; peripheral neuropathy; 

KW Alzheimer's disease; Parkinson's disease; ss. 

XX 

OS Rattus sp. 
XX 

FH Key. Location/Qual if iers 

FT CDS 1..336 
FT /*tag= a 

FT /product^ "Mechano -growth factor {MGF)" 

FT /note= "This region comprises exons 3-6. The CDS does 

FT not include start codon" 

FT /partial 

XX 

PN WO200136483-A1. 
XX 

PD 25-MAY-2001. 
XX 

PF 15-NOV-2000; 2000WO-GB04354 . 
XX 

PR 15-NOV-1999; 9 9GB- 002 6968 . 
XX 

PA {UNLO ) UN IV COLLEGE LONDON . 
XX 

PI Goldspink G, Johnson I; 
XX 

DR WPI; 2001-355620/37'. 

DR P-PSDB; AAE02448. 
XX 

PT Use of mechano-growth factor, an isoform of Insulin-like Growth 

PT Factor-I, capable of reducing motoneurone loss, in the manufacture of a 

PT medicament for the treatment of neurological disorder - 

XX 

PS Claim 4; Page 51-52; 66pp; English. 
XX 

CC The present invention relates to use of mechano-growth factor (MGF) , 

CC an Insulin-like Growth Factor-I (IGF-I) isoform in the manufacture of a 

CC medicament for the treatment of neurological disorder. The MGF is capable 

CC of reducing motoneurone loss by 2 0% or greater in response to nerve 

CC avulsion, and effects motoneurone rescue, preferably adult motoneurone 

CC rescue. The MGF polynucleotide and polypeptide are useful in the 

CC manufacture of a medicament for the treatment of a neurological disorder, 

CC including a disorder of motoneurones and/or neurodegenerative disorder, 

CC e.g.., amyotrophic lateral sclerosis, spinal muscular atrophy, progressive 

CC spinal muscular atrophy, infantile or juvenile muscular atrophy, 

CC poliomyelitis or post -polio syndrome, a disorder caused by exposure to a 

CC toxin, motoneurone trauma, a motoneurone lesion or nerve damage, an 

CC injury that affects motoneurones, motoneurone loss associated with aging, 

CC autosomal or sex-linked muscular dystrophy, diabetic neuropathy, 

CC peripheral neuropathies, Alzheimer's disease and Parkinson's disease. 

CC The present sequence is rat IGF-I isoform MGF cDNA. MGF is a muscle 

CC isoform having extracellular (Ec) domain, hence also referred as 

CC IGF-I-Ec. The MGF protein comprises amino acid sequences encoded by 

CC nucleic acid sequence of IGF-I exons 4, 5 and 6 in the reading frame 

CC of MGF. 



XX 

SQ Sequence 539 BP; 161 A; 136 C; 139 G; 103 T; 0 other; 



Query Match 68.2%; Score 356.8; DB 22; Length 539; 

Best Local Similarity 82.3%; Pred. No. 5.1e-95; 

Matches 436; Conservative 0; Mismatches 87; Indels 7; Gaps 2; 
Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Mill Mill II MMI MM MM Mill I 1 1 1 II II II I II II II I II II 

Db 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 

I II I I II I I I I I I I I I II I I II I M I I I I II I I I I I I I II II I II II II II I I I 

Db 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTCGGAGGGCACCACAG 12 0 

Qy 121 ACAGG CATCGTGGATGAGTGCTG CTT CCGGAG CTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

II Mill Illllllllll Illllllllll : l illlllMlllllllllliMM 

Db 121 ACGGG CATTGTGGATGAGTGTTG CTT CCGGAG CTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

Qy 181 TGTG(^CCCCTO^GCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

MM I I llllll Mill MM II Ml MM llllllllllllll III 

Db 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCC^CACTGAC 24 0 

Qy 241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACC 3 00 

lllllllll II llllll II Mill I III II lllllllll Mil II III 

Db 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACA 300 

Qy 3 01 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGC^ 3 60 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IMIIMIIIIIIIIIMM MIMMIMIIMM III 

Db 301 AGAAGGAAAGGAAGTAC^CTTGAAGAACAC^GTAGAGGAAGTGCAGGAAAC^GACCTA 360 

Qy 3 61 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 42 0 

III llllllll II III I Mill I II I II I MMI II MMM 

Db 3 61 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 42 0 
Qy 421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATC 474 

II I llllll 1 1 1 1 III 1 1 1 1 .lllllllll II 

Db 421 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAATATCAATAATGAGTTCAATA 480 

Qy 475 ACATTTCAAAGAT-GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

II Nil llllllllll IIIIIIIIIIIIIIIIIIIIIIIII 

Db 481 TCATTTCAGAGATGGGCATTTCCCTCAATGAAATAC^CAAGTAAACATTC 53 0 



RESULT 9 
AAS16878 

ID AAS16878 standard; cDNA; 539 BP . 
XX 

AC AAS16878; 
XX 

DT 25-FEB-2002 (first entry) 
XX 

DE Rat mechano-growth factor (MGF) cDNA. 
XX 

KW Rat; mechano-growth factor; insulin-like growth factor I; IGF-I; MGF; 

KW neuroprotective; nerve damage; peripheral nervous system; nerve severing; 

KW muscle; neurological disorder; motoneuron loss; motorneuron disorder; ss; 



KW nerve avulsion. 
XX 

OS Rattus sp. 
XX 

FH Key Location/Qualifiers 

FT CDS 1 . -336 

FT /*tag= a 

FT /product = "Rat MGF" 

FT /partial 

FT /note= "No start codon" 

FT exon 1 . ..75 

FT /*tag= b 

FT I number^ exon 3 

FT exon 76 . . 258 

FT /*tag= c 

FT /number= exon 4 

FT exon 259 . .309 

FT /*tag= d 

FT /number= exon 5 

FT exon 310.. 333 

FT V*tag= e 

FT /number = exon 6 

XX 

PN WO200185781-A2. 
XX 

PD 15-NOV-2001. 
XX 

PF 10-MAY-2001; 2 001WO-GB02 054 . 
XX, 

PR 10-MAY-2000; 2 000GB- 0 01 1278 . 
XX 

PA (UNLO ) UN IV COLLEGE LONDON. 

PA (EGRI-) EAST GRINSTEAD MEDICAL RES TRUST. 

XX 

PI Goldspink G, Terenghi G; 
XX 

DR WPI; 2002-055585/07. 

DR P-PSDB; AAU10560. 
XX 

PT Use of insulin-like growth factor I (IGF- I) isoform known as 

PT mechano-growth factor which is encoded by IGF- I exons 4,5,6 and has 

PT ability to reduce motoneuron loss in response to nerve avulsion, to 

PT treat nerve damage 

XX 

PS Disclosure; Fig 6; 65pp; English. 
XX 

CC The invention relates to the use of an insulin-like growth factor I 

CC (IGF- I) isoform, known as mechano-growth factor (MGF) , in the manufacture 

CC of a medicament for treating nerve damage in the peripheral nervous 

CC system, or for treating nerve damage by localising MGF at the site of 

CC damage. The nerve damage may include severing of a nerve. The treatment 

CC may be combined with another treatment (such as a polypeptide growth 

CC factor other than MGF) that prevents or diminishes degeneration of the 

CC target organ (for example, muscle) which the damaged nerve innervates, 

CC whereby the treatment of the muscle with MGF or a polynucleotide encoding 

CC MGF prevents or diminishes degeneration. The method is useful for 

CC treating neurological disorders, preferably motorneuron disorders. These 



CC methods can reduce motoneuron loss by 2 0% or greater in response to nerve 

CC avulsion. This sequence represents cDNA encoding the rat MGF . 

XX 

SQ Sequence 539 BP; 161 A; 136 C; 139 G; 103 T; 0 others- 
Query Match 68.2%; Score 356.8; DB 24; Length 539; 
Best Local Similarity 82.3%; Pred. No. 5.1e-95; 

Matches 436; Conservative 0; Mismatches 87; Indels 7; Gaps 2; 
Qy 1 GGACCGGAGACG CTCTG CGGTGCTGAG CTGGTGGATG CT CTTCAGTTCGTGTGTGGAGAC 60 

Mill Mill M Mill M 1 1 1 1 1 1 1 M M I 1 1 II 1 1 M I II 1 1 1 1 1 1 1 1 1 1 

Db 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Qy 61 AGGGGCTTTTATTTO^CAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 

III 1 1 III I II IIMM II III II II I II MINIUM I Ml III 

Db 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCATTCGGAGGGCACCACAG 12 0 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

II Mill 1 1 1 1 1 M 1 1 M 1 1 1 1 1 M M 1 1 M 1 1 It 1 1 M II 11 1 1 M 

Db 121 ACGGG CATTGTGGATGAGTGTTG CTT CCGGAGCTGTGAT CTGAGGAGGCTGGAGATGTAC 18 0 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

MM I I IIMM Mill MM II Ml MM 1 1 Ml 

Db 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

Qy 241 ATGCCCAAGACT CAGAAGTATCAGCCTC CAT GTAC CAACAAGAAAATGAAGTCTCAGAGG 3 00 

1 1 1 1 1 1 1 1 1 1 1 1 1 ] 1 1 1 1 1 Mill I III II 1 1 1 1 1 1 1 M MM II III 

Db 241 ATG CC CAAGACT CAGAAGTCCCAG C C CCTATCGACACACAAGAAAAGGAAGCTG CAAAGG 3 00 

Qy 301 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAA^^ 36 0 

MMIMMMMIMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 III 

Db 3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGG 360 

Qy 361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 42 0 

Ml 1 1 1 11 1 M II III I Mill I III II I lllllll II IIMM 

Db 3 61 CAGAATGTAGGAGGAG CCTCC CGAGGAACAGAAAATG CCACGTCACCGCAAGATC CTTTG 420 
Qy 421 CTCTG CACAGTTACCTGTAAACATTGGAATACCGG CCA AAAAATAAGTTTGATC 474 

M I IIMM MM III MM II III MM II 

Db 421 CTGCTTGAGC^CCTGOW^CATCGGAACACCTGCa^TATCAATAATGAGTTCAATA 48 0 

Qy 475 ACATTTCAAAGAT-GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

lllllll MM MM MINI I I ! 1 

Db 481 T(^TTT(^GAGATGGGCATTTCCCT(^TGAAATACACAAGTAAACATTC 53 0 



RESULT 10 
ABV76185 

ID ABV76185 standard; cDNA; 651 BP. 
XX 

AC ABV76185; 
XX 

DT 07-MAR-2003 (first entry) 
XX 

DE Mouse insulin-like growth factor IB cDNA. 
XX 

KW Insulin-like growth factor IB; IGF-IB; mouse; mRNA; assay; 



KW nucleic acid detection; gene; ss. 
XX 

OS Mus musculus . 
XX 

FH Key Location/Qualifiers 

FT CDS 73 . . 474 

FT /*tag= a 

FT /product^ "IGF-IB" 

XX 

PN WO200297390-A2 . 
XX 

PD 05-DEC-2002. 
XX 

PF 31-MAY-2002; 2002WO-SE01056 . 
XX 

PR 01-JUN-2001; 2001SE- 0001934 . 
XX 

PA (BIOV-) BIOVITRUM AB . 
XX 

PI Parrow V, Rosengren L; 
XX 

DR WPI; 2003-129529/12. 
XX 

PT Quant itating a target nucleic acid in a sample comprises immobilizing, 

PT on a solid support, a sample comprising a target nucleic acid, and 

PT detecting and quant i tat ing , signals generated from the antisense and 

PT sense probes - 
XX 

PS Example 1; Page 16-17; 18pp; English. 
XX 

CC The present sequence is that of cDNA encoding murine insulin- like 

CC growth factor IB (IGF-IB) . The cDNA was used in an example of the 

CC method of the invention to generate probes for determination of 

CC IGF-IB RNA . The method comprises a quantitative hybridisation 

CC assay for analysis of mRNA in a target nucleic acid- (TNA) sample. 

CC It involves: (i) immobilising the TNA sample on a solid support; 

CC (ii) contacting a labelled antisense probe to a first portion of the 

CC TNA, and a labelled sense probe to a second portion of the TNA; 

CC (iii) detecting and quantitating the signals generated from the 

CC hybridised probes; and (iv) determining the value represented by 

CC the antisense probe signal minus the sense probe signal, the value 

CC being proportional to the amount of mRNA in the TNA sample. In an 

CC example of the method, a cDNA clone containing 60 nucleotides from 

CC exon 2 and 179 nucleotides from exon 3 of the mouse IGF-IB gene was 

CC cloned into pGEN-4Z vector. Linearisation of the plasmid with 

CC EcoRI allowed transcription of a 250-nucleotide antisense probe 

CC using T7 polymerase. Linearisation with Hindi I I allowed 

CC transcription of a sense probe of similar length using SP6 

CC polymerase (see ABV76186) . The probes were purified and used to 

CC determine IGF-I RNA in mouse hepatocytes and also in rat hepatocytes . 

XX 

SQ Sequence 651 BP; 193 A; 185 C; 149 G; 124 T; 0 other; 



Query Match 66.8%; Score 349.4; DB 25; Length 651; 

Best Local Similarity 82.8%; Pred. No. 8.3e-93; 

Matches 425; Conservative 0; Mismatches 81; Indels 7; Gaps 2; 



Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 6 0 

IIIII IIIII II IIIII MIMMIIIIIIIIIIIIIIMIIIIIIIIIIMI 

Db 139 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCG 198 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 

IIIIIIMIII llllllllll II II llllllllll llllllllllllllll 

Db 19 9 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCTCAG 258 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

Illlllll Illllllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II I MINI 1 1 III 

Db 25 9 ACAGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGACTGGAGATGTAC 318 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGGCACACCGAC 24 0 

IIIII II II IIIII MM Illllllllll MM Mill III Ml MM III 

Db 319 TGTGGCCCACTGAAGCCTACAAAAGCAGCCCGCTCTATCCGTGCCCAGCGCCACACTGAC 378 

Qy 241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAACAAGAAAATGAAGTCTCAGAGG 3 00 

II 1 1 1 II! ll I II Ml 1 1 II I I III II MINIMI MM IIIII 

Db 37 9 ATGCCCAAGACTCAGAAGTCCCCGTCCCTATCGACAAACAAGAAAACGAAGCTGCAAAGG 43 8 

Qy 301 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

Illllllllll I II III II I II MIMIMMMIMM M 1 1 1 1 1 E 1 1 1 1 1 1 ! I III 

Db 43 9 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGAAGTG CAGGAAACAAGAC CTA 4 98 

Qy 361 CAGGATGTAGGAAGACC CTT CTGAGGAGTGAAGAAGGACAGGCCACCG CAGGAC C CTTTG 42 0 

Ml IIIIMM II III I MM I II I II IMMIIMI llllll 

Db 499 CAGAATGTAGGAGGAGCCTCCCACGGAGCAGAAAATGCCACATCACCGCAGGATCCTTTG 558 
Qy 421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA - -AAAAATAAGTTTGATC 474 

II I MINI I II III III II MINI II 

Db 55 9 CTGCTTGAGCAACCTGCAAAACATCGAAACACCTACCAAATAACAATAATAAGTCCAATA 618 

Qy 4 75 ACATTTCAAAGAT - GGCATTTCCCCCAATGAAA 5 06 

IIIII Mill II III I II III I II MM II I 

Db 619 ACATTACAAAGATGGGCATTTCGCCCAATGAAA 651 



RESULT 11 
AAN70436 

ID AAN70436 standard; cDNA; 818 BP. 
XX 

AC AAN70436; 
XX 

DT 25-MAR-2003 (updated) 

DT 05-APR-1991 (first entry) 

XX 

DE Sequence encoding insulin- like growth factor 1A (IGF-1A) . 
XX 

KW Growth promoter; lactation enhancer; cell proliferation; ss. 
XX 

OS Homo sapiens . 
XX 

PN EP229750-A. 
XX 

PD 22-JUL-1987. 
XX 

PF 06-JAN-1987; 87EP- 087 000 1 . 



XX 

PR 20-NOV-1986; 86US-0929671 . 

PR 07-JAN-1986; 86US-0816662 
XX 

PA (UNIW ) UNIV WASHINGTON. 
XX 

PI Krivi GG, Rotwein PS; 
XX 

DR WPI; 1987-200203/29. 
XX 

PT New pre-pro- insulin- like growth factor- 1 protein - obtd. by 

PT recombinant DNA procedures for use as growth promoters for 

PT enhancing lactation, for stimulating cell proliferation etc. 
XX 

PS Example; Fig 5; 59pp; English. 
XX 

CC A 42 base oligonucleotide corresponding to the DNA sequence encoding 
CC . amino acids 10 to 23 of mature human IGF-I was synthesized (AAN70437) . 

CC The radiolabeled 42 mer was then employed to screen for IGF-I 

CC containing DNA sequences in a human liver cDNA library. Insulin- 

CC like growth factors -1A and -IB cDNAs were isolated from a human cDNA 

CC 'library by using lambdagt 11 (AAN70435, AAN70436) . The human IGF-1 

CC genomic gene was isolated and mapped. It encodes at least two 

CC preproinsulin-like growth factor-1 proteins. An essentially pure 

CC proproinsulin-like growth factor-1 protein comprising the sequence 

CC of amino acids shown in Figure six is claimed (AAP70277) . 

CC (Updated on 25-MAR-2003 to correct PA field.) 
XX 

SQ Sequence 818 BP; 232 A; 186 C; 187 G; 213 T; 0 other; 

Query Match 63.9%; Score 334.4; DB 8; Length 818; 

Best Local Similarity 84.6%; Pred. No. 2.4e-88; 

Matches 445; Conservative 0; Mismatches 26; Indels 55; Gaps 4; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I II I I I II I II I M IN II Mill I I II III IN III I I III llllllllllllll 

Db 2 03 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 2 62 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 

I IIIIIIIIIIIIIIMII Mill II lilllllll , MINIM llllll 

Db 263 AGGGGCTTTTATTTCAACAAGCCC^CAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 322 

Qy 121 ACAGG CAT CGTGGATGAGTGCTG CTTC CGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

III MINI llllllllllllll IIIIIIIIIIIIIIMII MMMMIIMIMM 
Db 323 ACAGG CAT CGTGGATGAGTGCTG CTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 382 

Qy 181 TGTGCACCCCTCAAGCCGGC^WWXC^^ 24 0 

II llllllllllllll II III MM Mill i 

Db 383' TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 442 

Qy 241 ATG CCCAAGACTCAGAAGTAT CAG C CTCCATCTACCAACAAGAAAATGAAGTCTCAGAGG 3 00 

I II llllll II III 

Db 443 ATGCCCAAGACCCAG 4 57 

Qy 3 01 AGAAGGAAAGGAAGTACATTTGAAGAACAC^AGTAGAGGGAGTGCAGGAAACAAGAACTA 3 60 

II II MM I Mill 1 1 Mill llllllllllllll MIMMMMMMM 

Db 458 AAGGAAGTACATTTGAAGAACG CAAGTAGAGGGAGTG CAGGAAACAAGAACTA 510 



Qy 

Db 

Qy 
Db 

Qy 
Db 



361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 42 0 

i 1 1 1 1 ! 1 1 1 ! 1 1 1 1 1 1 1 IIIIIIIMIIIII II 1 1 ill: : I llllll 

511 CAGGATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTG 570 
421 CTCTGCAC-AGTTACCTG-TAAAC^TTGGAATACCGGCCAAAAAATAAGTTTGATa^CAT 478 

llllll II II I II II 1 1 Mill llllll III IIIIIIIMIIIII MM MM 

571 CTCTGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACAT 63 0 
479 TTCAAAGAT-GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

II llllll III 1 1 ! 1 1 Ml 1 1 1 1 1 ! 1 1 1 II I M 1 1 1 1 II 

631 TTAAAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACATTC 676 



ABT11091; 

04-DEC-2002 (first entry) 

Human breast cancer associated coding sequence SEQ ID NO: 1225. 

Human; breast specific gene; breast cancer; differential expression; 
cytostatic; gene therapy; gene; ss. 

Homo sapiens . 

WO200259271-A2. 



01-AUG-2002. 



25-JAN-2002; 2002WO-US02 176 . 



RESULT 12 
ABT11091 

ID ABT11091 Standard; CDNA; 7260 BP. 
XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
XX 
OS 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
PR 
PR 
XX 
PA 
XX 
PI. 
XX 
DR 
XX 
PT 
PT 
PT 
XX 
PS 
XX 
CC 
CC 
CC 
CC 
CC 
CC 



25-JAN-2001; 
25-APR-2001; 
23-MAY-2001; 



2001US-263757P. 
2001US-286090P. 
2001US-292517P. 



(GENE-) GENE LOGIC INC . 

Orr MS, Nation M, Diggans JC, Zeng W; 
WPI; 2002-674803/72. 

Diagnosing breast cancer in a patient comprises detecting the level of 
gene expression in cell or tissue samples, where a differential gene 
expression is indicative of breast cancer 

Claim 1; SEQ ID NO 1225; 260pp + Sequence Listing; English. 

The present invention relates to methods of diagnosing breast cancer in a 
patient, which comprise detecting the level of expression in a tissue 
sample of two or more genes selected from those shown in ABT09867- 
ABT11112, where a differential expression of the genes indicates breast 
cancer. The methods are useful in diagnosing, treating, detecting the 
progression, and in monitoring treatment of breast cancer in patients. 



CC The methods are also useful as a screening tool for agents that modulate 

CC the onset or progression of breast cancer. The breast cancer genes may be 

CC used as diagnostic markers for the prediction or identification of the 

CC malignant state of breast tissue, for confirming the type and progression 

CC of cancer, and for drug screening and assays. The present sequence is a 

CC coding sequence of the invention. 

CC Note: The sequence data for this patent did not form part of the printed 

CC specification, but was obtained in electronic format directly from WIPO 

CC at ftp.wipo.int/pub.published_pct_sequences. 
XX 

SQ Sequence 7260 BP; 2330 A; 1415 C; 1240 G; 2275 T; 0 other; 

Query Match 63.9%; Score 334.4; DB 24; Length 7260; 

Best Local Similarity 84.6%; Pred. No. 5.7e-88; 

Matches 445; Conservative 0; Mismatches 26; Indels 55; Gaps 4; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I I I I I I! I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I 

Db 311 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 370 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGA 12 0 

1 1 1 1 M I M 1 1 1 1 1 1 M I Ml 1 1 1 II: II i 1 1 1 1 h 1 1 1 1 1 1 1 Mill 

Db 371 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCC^ 43 0 

Qy 121 ACAGG CAT CGTGGATGAGTG CTG CTTCCGGAGCTGTGATCTGAGGAGG CTGGAGATGTAC 18 0 

I I I I III I I I I I I I I I I I I I I M I II I ! I I I I ' I II M I I I I I I I I I I I I i I 

Db 431 ACAGG CAT CGTGGATGAGTG CTG CTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATG TAT 490 

Qy 181 TGTGC^CCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCQVCACCGAC 24 0 

II I III MINIM II III 1 1 1 1 Mill M i M ! 1 1 1 1 1 M 1 1 1 1 1 1 1 II 

Db 4 91 TGCGCACCCCTCAAGCCTGCC^GT(^GCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 55 0 

Qy 241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAACAAGAAA^ 300 

1 1 1 1 1 M I ; 1 1 in 

Db 551 ATGCCCAAGACCCAG 565 

Qy 3 01 AGAAGGAAAGGAAGTA^TTTGAAGAACAC^GTAGAGGGAGTGC^GGAAAO^GAACTA 36 0 

I II I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 566 AAGGAAGTACATTTGAAGAACGOVAGTAGAGGGAGTGCAGGAAACAAGAACTA 618 

Qy 361 CAGGATGTAGGAAGACCCTT CTGAGGAGTGAAGAAGGACAGG CCACCG CAGGACCCTTTG 42 0 

I I I I I I I I I I I I ! I I I I IMIIIIMIMM I I I I I I I I I I I I I I I I I I I I I I 

Db 619. CAGGATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTG 678 

Qy 421 CTCTGCAC -AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACAT 478 

II llllll MINIMI Mill MINI III Mill IMIIMI Mill MM 

Db 679 CTCTGCACGAGTTACCTGTTAAACTTTGGAACACCTACC^AAAAATAAGTTTGATAACAT 738 



Qy 4 79 TTCAAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

M llllll III MMMMMMIMM I lllllll 

Db 73 9 TTAAAAGATGGGCGTTTCCCCCAA.TGAAATACACAAGTAAACATTC 784 



RESULT 13 
ABK84583 

ID ABK84583 standard; cDNA ; 7260 BP. 
XX 



AC ABK84583; 
XX 

DT 14-AUG-2002 (first entry) 
XX 

DE Human cDNA differentially expressed in granulocytic cells #1154. 
XX 

KW Human; ss; granulocytic cell; DNA chip; bacterial infection; 

KW viral infection; parasitic infection; protozoal infection; 

KW fungal infection; sterile inflammatory disease; psoriasis; 

KW rheumatoid arthritis; glomerulonephritis; asthma; thrombosis; 

KW cardiac reper fusion injury; renal reperfusion injury; ARDS; 

KW adult respiratory distress syndrome; inflammatory bowel disease ; 

KW Crohn's disease; ulcerative colitis; periodontal disease; 

KW granulocyte activation; chronic inflammation; allergy. 

XX 

OS Homo sapiens. 
XX 

PN WO200228999-A2 . 
XX 

PD ll-APR-2002. 
XX 

PF 03-OCT-2001; 2001WO-US30821 . 
XX 

PR 03-OCT-2000; 2 00 0US-237 18 9P . 
XX 

PA (GENE-) GENE LOGIC INC. 
XX 

PI Beazer-Barclay Y, Weissman SM, Yamaga S, Vockley J; 
XX 

DR WPI; 2002-435328/46. 
XX 

PT Detecting granulocyte activation by detecting differential expression 

PT of genes associated with granulocyte activation, which serves as 

PT diagnostic markers that is useful for monitoring disease states and 

PT drug toxicity - 
XX 

PS Claim 1; SEQ ID No 1154; 114pp; English. 
XX 

CC The invention relates to detecting (Ml) granulocyte (GC) activation 

CC (GCA) , by detecting the level of expression of gene(s) (Gs) identified by 

CC DNA chip analysis as given in the specification, and comparing 

CC the expression level to an expression level in an unactivated 

CC GC, where differential expression of Gs is indicative of GCA. 

CC Also included are modulating (M2) GA by contacting GC with an agent 

CC that alters the expression of at least one gene in Gs; (2) screening (M3) 

CC for an agent capable of modulating GCA or an inflammation (especially 

CC chronic) in a tissue, an allergic response in a subject, exposure of a 

CC subject to a pathogen or sterile inflammatory disease using the 

CC gene expression profile; (3) detecting (M4) an inflammation (especially 

CC chronic) in a tissue, an allergic response in a subject, exposure of a 

CC subject to a pathogen or sterile inflammatory disease, by detecting the 

CC level of expression in a sample of the tissue of gene(s) from Gs, where 

CC the level of expression of the gene is indicative of inflammation; 

CC (4) treating (M5) an inflammation (especially chronic) or in a tissue, 

CC an allergic response in a subject, exposure of a subject to a pathogen 

CC or sterile inflammatory disease, by contacting a tissue having 

CC inflammation with an agent that modulates the expression of gene(s) 



from Gs in the tissue. Ml is useful for detecting GCA; M2 is useful for 
modulating GA; M3 is useful for screening an agent capable of modulating 
GCA preferably in an inflammation in a tissue; M4 is useful for 
detecting an inflammation (especially chronic) in a tissue, an allergic 
response in a subject, exposure of a subject to a pathogen or sterile 
inflammatory disease (e.g. psoriasis, rheumatoid arthritis, 
glomerulonephritis, asthma, thrombosis, cardiac reperfusion injury, renal 
reperfusion injury, ARDS, adult respiratory distress syndrome, 
inflammatory bowel disease, Crohn's disease, ulcerative colitis, 
periodontal disease,- also bacterial infection, viral infection, 
parasitic infection, protozoal infection, fungal infection and M5 is 
useful for treating one of the above conditions. The present 
sequence represents a gene differentially expressed in granulocytes. 
Note: The sequence data for this patent did not form part 
of the printed specification, but was obtained in electronic 
format directly from WIPO at 
ftp.wipo. int/pub/published_pct_sequences . 

Sequence 7260 BP; 2330 A; 1415 C; 1240 G; 2275 T; 0 other; 

Query Match 63.9%; Score 334.4; DB 24; Length 7260; 

Best Local Similarity 84.6%; Pred. No. 5.7e-88; 

Matches 445; Conservative 0; Mismatches 26; Indels 55; Gaps 4; 



Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

i [ 1 1 ! 1 1 1 1 1 1 1 ; 1 1 1 1 1 1 1 1 1 Ml 1 1 1 1 1 1 1! 1 1! 1 1 1 1 1 1 1 1 1 1 1 I i I 

Db 311 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 370 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 

1 II IIIMIIIIIIIIIIIMII llllll 

Db 371 AGGGGCTTTTATTTC^CAAGCCCACAGGGTATGGCTCCAGGAGTCGGAGGGCGCCTCAG 43 0 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

III Ml 1 1 1 1 II III I i I MM llllll MM! llllll IIIIIIIIIIIIIIII I 

Db 431 ACAGG CATCGTGGATGAGTGCTG CTTCCGGAGCTGTGATCTAAGGAGG CTGGAGATGTAT 4 90 

Qy 181 TGTGCA.CCCCTC^VGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

II llllllllllllll II III 1 1 1 1 Mill lllllllll I ! 1 1 1 1 1 1 1 I 

Db 4 91 TGCGCA.CCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 55 0 

Qy 241 ATGCCC^GACTCAGAAGTATCAGCCTCC^TCTAC(^CAAGAAAATGAAGTCTCAGAGG 3 00 

IIIIMMI III 

Db 551 ATGCCCAAGACCCAG 565 

Qy 301 AGAAGGAAAGGAAGTACATTTGAAGAACA.CAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

i i i s ri i i i i i u i mm mi i i in 

Db 566 AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTA 618 

Qy 361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 42 0 

1 1 II I II 1 1 1 1 1 M 1 1 1 llllllllllllll 1 1 1 1 IIIIIIIIIIII llllll 

Db 619 CAGGATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTG 678 

Qy 421 CTCTGCAC-AGTTACCTG-TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACAT .478 

IIIMIII lllllllll Mill llllll III II II II I M I II 1 1 1 1 1 1 1 1 II 

Db 67 9 CTCTGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACAT 73 8 

Qy 47 9 TTCAAAGAT-GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 



CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
XX 
SQ 



II Illlll III I II I III 1 1 1 MM 1 1 1 1 II I II II 

Db 73 9 TTAAAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACATTC 784 



RESULT 14 
ABN97244 

ID ABN97244 standard; DNA; 7260 BP. 
XX 

AC ABN97244; 
XX 

DT 13-AUG-2002 (first entry) 
XX 

DE Gene #3742 used to diagnose liver cancer. 
XX 

KW Gene; liver cancer; ds; hepatocellular carcinoma; hepatotropic; 

KW metastatic liver tumour; cytostatic; expression profile; disease state; 

KW disease progression; drug toxicity; drug efficacy; drug metabolism. 

XX 

OS Homo sapiens. 
XX 

PN WO200229103-A2 . 
XX 

PD ll-APR-2002. 
XX 

PF 02-OCT-2001; 2001WO-US3 0589 . 
XX 

PR 02-OCT-2000; 2000US-237054P . 
XX 

PA (GENE- ) GENE LOGIC INC. 
XX 

PI Home D, Alvares C, Peres-Da-Silva S, Vockley JG; 
XX 

DR WPI; 2002-426119/45. 
XX 

PT Diagnosing and detecting the progression of liver cancer, 

PT hepatocellular carcinoma or metastatic liver tumor in a patient, 

PT involves detecting the level of expression of two or more genes in a 

PT liver tissue sample 

XX 

PS Claim 1; SEQ ID NO 3742; 298pp ; English. 
XX 

CC The invention relates to a novel method for diagnosing and detecting the 

CC progression of liver cancer, hepatocellular carcinoma or metastatic liver 

CC tumour in a patient, and differentiating metastatic liver cancer from 

CC hepatocellular carcinoma in a patient, involving detecting the level of 

CC expression of two or more genes represented in ABN93503 -ABN974 55 in a 

CC tissue sample. The method of the invention has hepatotropic, and 

CC cytostatic activity. The method is useful for diagnosing and detecting 

CC the progression of liver cancer, hepatocellular carcinoma and metastatic 

CC liver carcinoma in a patient. The method is useful for identifying 

CC expression profiles which serve as useful diagnostic markers as well as 

CC markers that can be used to monitor disease states, disease progression, 

CC drug toxicity, drug efficacy and drug metabolism. 

CC Note: The sequence data for this patent did not form part of the printed 

CC specification, but was obtained in electronic format directly from WIPO 

CC at ftp.wipo.int/pub/published__pct_sequences. 
XX 



SQ Sequence 7260. BP; 2330 A; 1415 C; 1240 G; 2275 T; 0 other; 



Query Match 63.9%; Score 334.4; DB 24; Length 7260; 

Best Local Similarity 84.6%; Pred. No. 5.7e-88; 

Matches 445; Conservative 0; Mismatches 26; Indels 55; Gaps 4; 



Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

1 1 1 1 1 1 ill 1 1 1 Ml i I in Mil 1 1 II 1 1 II I M 1 1 1 1 1 1 li 1 1 1 II h 

Db 311 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 370 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 

1 1 1 1 M J 1 1 1 M 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 M M 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 MINI 

Db 3 71 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 430 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

1 1 1 1 II! M I II I II I II M I M 1 1 1! 1 1 IMM I 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 

Db 431 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 4 90 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

II IMIIIIIIMIII II Ml 1 1 1 1 IMM MMMMMMMIMMIMM 

Db 4 91 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 550 

Qy 241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAACAAGAAAATGAAGTCTCAGAGG 3 00 

1 1 1 1 1 1 1 1 1 1 1 Ml 

Db 551 ATGCCCAAGACCCAG 565 

Qy 3 01 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 3 60 

! M 1 1 1 1 1 1 1 1 1 ! 1 1 i 1 1 Ml I IMM 1 1 IMM I M II 1 1 IM 

Db 566. AAGGAAGTACATTTGAAGAACG CAAGTAGAGGGAGTG CAGGAAACAAGAACTA 618 

Qy 3 61 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGG CCACCG CAGGACC CTTTG 420 

1 1 1 1 1 ; 1 1 1 ! ; 1 1 1 i i I I IIIIIIIIIIIIM I II I 1 1 1 1 II I II 1 1 I 1 1 II II 

Db 619 CAGGATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTG 678 

Qy 421 CTCTGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACAT 478 

! 1 1 i 1 1 1 1 IIIMIMI Mill MM III I M 1 1 1 1 1 1 M 1 1 II M I 1 1 1 1 

Db 679 CTCTGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACAT 738 

Qy 479 TTCAAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

II MUM Ml I : M II 1 1 II MM Mill 1 1 1 M 1 1 

Db 739 TTAAAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACATTC 784 

RESULT 15 
ABK64812 

ID ABK64812 standard; DNA; 7260 BP. 
XX 

AC ABK64812; 
XX 

DT 18-JUN-2002 (first entry) 
XX 

DE Human benign prostatic hyperplasia gene #707. 
XX 

KW Human; benign prostatic hyperplasia; BPH; prostate cancer; gene; ds . 
XX 

OS Homo sapiens. 
XX 



PN WO200212440-A2 . 
XX 

PD 14-FEB-2002. 
XX 

PF 07-AUG-2001; 2001WO-US24708 . 
XX 

PR 07-AUG-2000; 2000US-223323 P . 

PR 05-JUN-2001; 2001US-0873319 . 
XX 

PA (GENE- ) GENE LOGIC INC. 

PA (NISB ) JAPAN TOBACCO INC. 
XX 

PI Munger WE, Kulkarni P, Getzenberg RH, Waga I, Yamamoto J; 
XX 

DR WPI; 2002-257476/30. 
XX 

PT Identifying drugs for and diagnosing benign prostatic hyperplasia, by 

PT detecting expression levels of one or more genes in prostate cells from 

PT patient that are differentially regulated compared to normal prostate 

PT cells 
XX 

PS Disclosure; Page 391-393; 444pp ; English. 
XX 

CC The invention relates to a method of diagnosing (I) the onset or 

CC progression of benign prostatic hyperplasia (BPH) , or screening (II) for 

CC or identifying an agent that modulates the onset or progression of BPH. 

CC The method is based on changes in gene expression in BPH tissue isolated 

CC from patients exhibiting different clinical states of prostate 

CC hyperplasia as compared to normal prostate tissue. (I) comprises 

CC detecting the expression levels of one or more genes in prostate cells 

CC from the subject that are differentially regulated compared to normal 

CC prostate cells. (II) comprises preparing a first gene expression profile 

CC of BPH cells or BPH-like cell population, exposing the cells to the 

CC agent, preparing a second gene expression profile of the agent exposed 

CC cells, and comparing the first and second gene expression profiles. 

CC (I) is useful for diagnosing the onset or progression of BPH. (II) is 

CC useful for identifying an agent that modulates the onset or progression 

CC of BPH. The methods are useful to present information identifying 

CC the expression level in a tissue or cells, by comparing the expression 

CC level of genes given in the specification in the tissue or cells to the 

CC level of expression of gene in the database, and displaying the 

CC expression levels of at least one gene in the tissue or cell sample 

CC compared to the expression level in BPH. Agents using (II) are useful for 

CC treating BPH or prostate cancer. ABK64106-ABK64860 represent human 

CC benign prostatic hyperplasia gene sequences of the invention. 

XX 

SQ Sequence 7260 BP; 2330 A; 1415 C;. 1240 G; 2275 T; 0 other; 

Query Match 63.9%; Score 334.4; DB 24; Length 7260; 
Best Local Similarity 84.6%; Pred. No. 5.7e-88; 

Matches 445; Conservative 0; Mismatches 26; Indels 55; Gaps 4; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

IIIMIIIIIMIIIIIIII IIIIIIIIIUIIMIIIMIMIIIIIIIIIIIIIIII 

Db 311 GGAC CGGAGACG CT CTG CGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 370 



Qy 



61 AGGGGCTTTTATTTC^CAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 



i 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II llllllllllllllllllll IMIII 

Db 371 AGGGGCTTTTATTTCA^C^GCCt^C^GGGTATGGCTCCTVGCAGTCGGAGGGCGCCTCAG 430 

Qy 121 ACAGG CATCGTGGATGAGTGCTGCTTCCGGAG CTGTGATCTGAGGAGG CTGGAGATGTAC 18 0 

I I li II I II I II I I I I IN I I I I I II I I I I I I I I I I IMIII Mill IMIII 

Db 431 ACAGG CATCGTGGATGAGTGCTGCTTCCGGAG CTGTGATCTAAGGAGG CTGGAGATGTAT 4 90 

Qy 181 TGTGC^CCCCTC^GCCGGC^WVGGC^ 24 0 

II I MM 1 1 MM II I M Ml MM Mill 1 1 1 1 i I ! 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 

Db 4 91 TGCGCACCCCTCAAGCCTGCC^GTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 550 

Qy 241 ATG C C CAAGACTCAGAAGTATCAG CCTCCATCTACCAACAAGAA 300 

1 1 1 1 1 M ) 11 1 III 

Db 551 ATGCCCAAGACCCAG 565 

Qy 3 01 AGAAGGAAAGGAAGTA(^TTTGAAGAA(^a^GTAGAGGGAGTGCAGGAAACAAGAACTA 360 

1 1 Ml 1 1 1 1 1 M 1 1 1 1 1 II I IIIIIIIIIIMMIIIIIIIIIIIIIIIII 

Db 566 AAGGAAGTACATTTGAAGAACG(^GTAGAGGGAGTGO\GGAAACAAGAACTA 618 

Qy 361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 420 

IIIIIIIIIIIMIIMM MM MM 1 1 III I 1 1 M I II II I II M 1 1 II I II I 

Db 619 CAGGATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATG CCACCG CAGGAT CCTTTG 678 

Qy 421 CTCTGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACAT 478 

llllllll lllllllll Mill IMIII Ml IIIMIIMIIIIIIIII MM 

Db 679 CTCTGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACAT 738 

Qy 479 TTCAAAGAT - GGCATTTGCCCCAATGAAATACACAAGTAAACATTC 523 

II 1 1 1 1 1 III I M 1 1 1 1 1 1 II II M II 1 1 1 1 1 1 1 M I II 1 1 II 

Db 73 9 TTAAAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACATTC 784 
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Title: 

Perfect score: 
Sequence : 



US-09-852-261-5 
523 

1 ggaccggagacgctctgcgg aaatacacaagtaaacattc 523 



Scoring table: 



I DENTI TY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched : 



569978 seqs, 220691566 residues 



Total number of hits satisfying chosen parameters: 



1139956 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing : Minimum Match 0% 
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RESULT 1 

US-09-142-583A-3 

; Sequence 3, Application US/09142583A 
; Patent No. 6221842 

GENERAL INFORMATION: 

APPLICANT: GOLDSPINK, GEOFFREY 

TITLE OF INVENTION: METHOD OF TREATING MUSCULAR DISORDERS 
NUMBER OF SEQUENCES: 11 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: NIXON & VANDERHYE P.C. 

STREET: 1100 NORTH GLEBE ROAD 

CITY: ARLINGTON 

STATE: VA 

COUNTRY: USA 

ZIP: 22201 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 



OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE : Patent In Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/142 , 583A 

FILING DATE: 29-Oct-1998 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA : 

APPLICATION NUMBER: WO PCT/GB97/00658 

FILING DATE: ll-MAR-1997 

APPLICATION NUMBER: GB 9605124.8 

FILING DATE: ll-MAR-1996 
ATTORNEY/AGENT INFORMATION: 

NAME: SADOFF , B. J. 

REGISTRATION NUMBER: 36663 

REFERENCE/DOCKET NUMBER: 117-263 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 7038164 000 

TELEFAX: 7038164100 
INFORMATION FOR SEQ ID NO: 3: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 553 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : both 

TOPOLOGY: linear 
MOLECULE TYPE : cDNA 
FEATURE : 

NAME /KEY: CDS 

LOCATION: 1..363 
SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
US-09-142-583A-3 

Query Match 100.0%; Score 523; DB 3; Length 553; 

Best Local Similarity 100.0%; Pred. No. 1.6e-154; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 6 0 

II) MIIMMIIIIMIII ' I IMM IIIIIIII IM MMMMIMM 

Db 31 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 90 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCT^ 120 

MM MIMIIIMIIIIIIIM MMMMIMIMM MIIIIIIIMM 

Db 91 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 150 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

1 1 1 1 M 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 ! I IIIIIIII MINI Mill IIIIIIIIIIIMI 

Db 151 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 210 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

MIMMMMMMMMIIMM II IIIIIIII llllll Mill MMIMMIMM 

Db 211 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 270 

Qy 241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAACAAGAAAATGAAGTCTCAGAGG 300 

II M II 1 1 1 II II 1 1 Ml I M I I II II I I Mill I M; II II Ml II I IM 

Db 271 ATGCCCAAGACTCAGAAGTATCAGCCTCC^TCTACC^CAAGAAAATGAAGTCTC^GAGG 330 

Qy 301 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTCC^ 360 

Ml M M I II III 1 1 Mill I II II IIIIIIII IMMI Mill III III II 1 1 III I 



Db 



331 AGAAGGAAAGGAAGTAC^TTTGAAGAACACAAGTAGAGGGAGTGCAGGAAAO^AGAACTA 3 90 



Qy 3 61 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGG C CACCGCAGGACCCTTTG 42 0 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

Db 3 91 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 450 

Qy 421 CTCTGCA.CAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTT 48 0 

IMIMIIIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIMIIIIIIIMM 

Db 4 51 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACATTT 510 

Qy 481 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

MIMIMIIIII I MINN lllllilllll IMIIIIII 

Db 511 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACATTC 553 



RESULT 2 

US-09-142-583A-5 

; Sequence 5, Application US/09142583A 
; Patent No. 6221842 

GENERAL INFORMATION: 

APPLICANT: GOLDSPINK, GEOFFREY 

TITLE OF INVENTION: METHOD OF TREATING MUSCULAR DISORDERS 
NUMBER OF SEQUENCES: 11 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: NIXON & VANDERHYE P.C. 

STREET: 1100 NORTH GLEBE ROAD 

CITY: ARLINGTON 

STATE: VA 

COUNTRY: USA 

ZIP: 22201 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE : Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/ 142 , 583A 

FILING DATE: 29-Oct-1998 

CLASSIFICATION: <Unknown> 
. PRIOR APPLICATION DATA: 

APPLICATION NUMBER: WO PCT/GB97/00658 

FILING DATE: ll-MAR-1997 

APPLICATION NUMBER: GB 9605124.8 

FILING DATE: ll-MAR-1996 
ATTORNEY/AGENT INFORMATION: 

NAME: SADOFF, B. J. 

REGISTRATION NUMBER: 36663 

REFERENCE / DOCKET NUMBER: 117-263 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 7 038164 000 

TELEFAX: 7038164100 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH:. 553 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : both 

TOPOLOGY: linear 



MOLECULE TYPE: cDNA 
FEATURE : 

NAME/ KEY: CDS 

LOCATION: 341.. 397 
SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
US-09-142-583A-5 



Query Match 100.0%; Score 523; DB 3; Length 553; 

Best Local Similarity 100.0%; Pred. No. 1.6e-154; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 6 0 

1 1 M III II MIM 1 1 IMIMI I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill I MM 

Db 31 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 90 



Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 

1 1 1 1 II 1 1 1 1 M I II I II II 1 1 M iM II 1 1 I M II II I M Ml I II 1 1 M 1 1 M 

Db 91 AGGGGCTTTTATTTC^C^GCCCAC^GGATACGGCTCC^GCAGTCGGAGGGCACCTCAG 150 



Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

1 1 M Ml IM MM 1 1 IMIM 1 1 II 1 1 1 1 1 I I II 1 1 1 1 1 1 1 1 1 1 1 II I M II 

Db 151 ACAGG CAT CGTGGATGAGTGCTG CTTCCGGAGCTGTGAT CTGAGGAGGCTGGAGATGTAC 210 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCC^GCGCC^CACCGAC 24 0 

M 1 1 II 1 1 1 1 M I II I M III 1 1 IMIII 1 1 M : : II 1 1 1 Mill I II 1 1 II 1 1 M 

Db 211 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 270 

Qy 241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAACAAG^ 3 00 

II 1 1 MM 1 1 1 1 Ml Mill I II I II I II I III M ' M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 271 ATG CCCAAGACTCAGAAGTATCAG C CT C CATCTACCAACAAGAAAATGAAGTCT CAGAGG 33 0 



Qy 3 01 AGAAGGAAAGGAAGTAC^TTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

1 1 II 1 1 M II 1 1 1 1 II III 1 1 1 II M I II II M 1 1 III 1 1 Ml M I M II 

Db 331 AGAAGGAAAGGAAGTAC^TTTGAAGAACAC^GTAGAGGGAGTGCAGGAAACAAGAACTA 3 90 

Qy 361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 420 

M II II 1 1 MIM 1 1 II Ml I II II I II Ml II M I II 1 1 1 1 1 1 M III MM 

Db 391 (^GGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 4 50 

Qy 421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAA 480 

IIIIIIIIIIMIIIIIIIIMIMMIIIMIIIIIIIIIIIIIIMIIIIIMMIM 

Db 4 51 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTT 510 

Qy 4 81 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

I 1 1 M 1 1 1 1 II 1 1 1 1 II III 1 1 MM 1 1 1 II M 1 1 1 M 

Db 511 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACATTC 553 



RESULT 3 

US-09-142-583A-10 

Sequence 10, Application US/09142583A 
Patent No. 6221842 
GENERAL INFORMATION: 

APPLICANT: GOLDS PINK, GEOFFREY 

TITLE OF INVENTION: METHOD OF TREATING MUSCULAR DISORDERS 
NUMBER OF SEQUENCES: 11 
CORRESPONDENCE ADDRESS: 



ADDRESSEE: NIXON & VANDERHYE P.C. 

STREET: 1100 NORTH GLEBE ROAD 

CITY: ARLINGTON 

STATE: VA 

COUNTRY: USA 

ZIP: 22201 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patent In Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/142 , 583A 

FILING DATE: 29-Oct-1998 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: WO PCT/GB97/ 00658 

FILING DATE: ll-MAR-1997 

APPLICATION NUMBER: GB 9605124.8 

FILING DATE: ll-MAR-1996 
ATTORNEY/AGENT INFORMATION: 

NAME: SADOFF, B-. J. 

REGISTRATION NUMBER: 36663 

REFERENCE/DOCKET NUMBER: 117-263 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 7038164000 

TELEFAX: 7038164100 
INFORMATION FOR SEQ ID NO: 10: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 777 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : both 

TOPOLOGY: linear 
MOLECULE TYPE: CDNA 
FEATURE : 

NAME /KEY: CDS 

LOCATION: 26. .493 
SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
US-09-142-583A-10 

Query Match 63.6%; Score. 332.8; DB 3; Length 777; 

Best Local Similarity 84.4%; Pred. No. 9.8e-95; 

Matches 444; Conservative 0; Mismatches 27; Indels 55; Gaps 4; 
Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 



Db 




Qy 



61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 



Db 




Qy 



121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 



Db 




Qy 



181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 



I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

3CGCACCCCTCAAGCCTGCC7^GTC^GCTCGCTCTGTCCGTGCCC^GCGCCACACCGAC 4 



Db 


359 


TGCGCACCCCTCAAGCCTGCC7^GTC^GCTCGCTCTGTCCGTGCCC^GCGCCACACCGAC 


418 


Qy 


241 


ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAACAAGAAAATGAAGTCTCAGAGG 
1 M II 1 ! 1 1 1 1 1 1 1 


300 


Db 


419 


II II 1 1 i 1 1 1 1 III 

ATGCCCAAGACCCAG 


433 


Qy 


301 


AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAAa^GAACTA 

1 1 1 1 1 1 1 M 1 1 M 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 i i 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 M II II i 1 II 1 II 1 1 1 II 1 1 II 1 II 1 

_ _ AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTA 


360 


Db 


434 


486 


Qy 


361 


CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 
1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 MINI 


420 


Db 


487 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 II 1 1 1 1 1 1 1 

CAGGATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTG 


546 


Qy 


421 


CTCTGCAC -AGTTACCTG -TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACAT 


478 


Db 


547 


MINI III III II! Mill Mill! Ill 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM 

CTCTGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACAT 


606 


Qy ^ 


479 


TTCAAAGAT -GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 




Db 


607 


II MUM III lllllll Mill IIIIIIIIMMIII Mill 

TTAAAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACATTC 652. 





RESULT 4 
5405942-2 

Patent No. 54 05942 

APPLICANT: BELL, GRAEME I . ; RALL , LESLIE B . ; MERRYWEATHER , 

JAMES P. 

TITLE OF INVENTION: PREPRO INSULIN-LIKE GROWTH FACTORS 
I AND II 

NUMBER OF SEQUENCES : 16 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/65,673 
FILING DATE: 16-JUN-1987 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 630,557 
FILING DATE: 19-JUL-1984 
SEQ ID NO: 2 : 

LENGTH: 622 
5405942-2 



Query Match 63.3%; Score 331.2; DB 6; Length 622; 

Best Local Similarity 68.3%; Pred. No. 2.8e-94; 

Matches 359; Conservative 84; Mismatches 28; Indels 55; Gaps 



4; 



Qy 



Db 



1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

MMMIMMIhhllll IhlllhlhMHhl-MhMhhhMMM 

45 GGACCGGAGACG CUCUGCGGGG CUGAG CUGGUGGAUG CUCUUCAGUUCGUGUGUGGAGAC 104 



Qy 

Db 

Qy 
Db 



61 AGGGGCTTTTATTTCAACAAGCCCAC^GGATACGGCTCC^GCAGTCGGAGGGC^CCTC^G 12 0 

lllll|::::h::|IMIIIIIIIIIII I I I I ' I II I I I I - I II II I I I Ihlll 
105 AGGGGCUUUUAUUUCAACAAGCCCACAGGGUAUGGC^ 164 

121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

I M M h I h 1 1 1 • 1 1 h I h I h : II II II h h I h I llllllhllllhhl 

165 ACAGGUAUCGUGGAUGAGUGCUG CUUC CGGAG CUGUGAUCAUAGGAGGCUGGAGAUGUAU 224 



Qy 


181 


TGTGCACCCCTCAAGCCGGCAAAGGC^GCCCGCTCCGTC^ 24 0 
1 MIIIM t 1 1 1 1 . 1 1 . 1 1 1 . 1 M 1 1 1 1 1 1 M 1 1 1 1 I 1 1 


Db 


225 


• 1 I i M I I 1 : II 1 I 1 1 11 111 M 11 III') 1 ' M 1 - ! 1 1 1 1 1 1 1 1 1 M i 

UGCGCACCCCUCAAGCCUGCCAAGUCAGCUCGCUCUGUCCGUGCCCAGCGCCACACCGAC 


284 


Qy 


241 


ATGCCCAAGACTCAGAAGTATC^GCC 


300 


Db 


285 


hlllllllll IN 


299 


Qy 


301 


AGAAGGAAAGGAAGTAC^TTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 

||Mllhllh::|||IMI lllhlllllllhllllllllllllMhl 
AAGGAAGUACAUUUGAAGAAGGCAAGUAGAGGGAGUGC^GGAAACAAGAACUA 


360 


Db 


300 


352 


Qy 


361 


CAGGATGTAGGAAGAC C CTTCTGAGGAGTGAAGAAGGACAGG CCACCGCAGGACCCTTTG 


420 


Db 


353 


| | | | | : | : | | | | | | | | | | : | : | | | | | | : | | | | | | | | | | | | | | | || 1 1 1 1 1 h - 1 
CAGGAUGUAGGAAGACCCUCCUGAGGAGUGAAGAGUGACAUGCCACCGCAGGAUCCUUUG 


412 


Qy 


421 


CTCTGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACAT 


478 


Db 


413 


hhllll HIM UNI = = 1111 III IMIMIhllh-lh Mh 

CUCUGCACGAGUUACCUGUUAAACUUUGGAACACCUACCAAA 


472 


Qy 


479 


TTCAAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 




Db 


473 


: : 1 | M h Ml = : M 1 II 1 1 h II M M II 1 II h II II h M 
UUAAAAGAUGGGCGUUUCCCCCAAUGAAAUACACAAGUAAACAUUC 518 





RESULT 5 

US-08-472-809B-8 

; Sequence 8, Application US/08472 809B 

; Patent No. 5925564 

; GENERAL INFORMATION: 

APPLICANT: Schwartz, Robert J . 
; APPLICANT: DeMayo , Franco J. 

APPLICANT: O'Malley, Bert W. 

TITLE OF INVENTION: Expression Vector Systems and 
TITLE OF INVENTION: Method of Use 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Lyon & Lyon 

STREET: 633 West Fifth Street 

STREET: Suite 4 7 00 

CITY: Los Angeles 

STATE: California 

COUNTRY: U.S.A. 

ZIP : 90071-2066 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 3.5" Diskette, 1.44 Mb 

MEDIUM TYPE: storage 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: IBM P.C. DOS 5.0 

SOFTWARE: Word Perfect 5.1 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/472 , 809B 

FILING DATE: June 7, 1995 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/209,846 



FILING DATE: March 9, 1994 

APPLICATION NUMBER: 07/789,919 

FILING DATE: No. 5925564ember 6, 1991 
ATTORNEY/ AGENT INFORMATION: 

NAME: Warburg, Richard J. 

REGISTRATION NUMBER: 32,327 

REFERENCE/DOCKET NUMBER: 214/212 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (213) 489-1600 

TELEFAX: (213) 955-0440 

TELEX: 67-3510 
; INFORMATION FOR SEQ ID NO: 8: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 5707 bases 

TYPE: nucleic acid 

STRANDEDNESS : double 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
US-08-472-809B-8 

Query Match 52.5%; Score 274.6; DB 2; Length 5707; 

Best Local Similarity 82.2%; Pred. No. 5.3e-76; 

Matches 351; Conservative 0; Mismatches 24; Indels 52; Gaps 1; 



Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

IIIIIMIIIIIIMMIII M: IMIIIIMI IIMIMIII IIIIMIM 

Db 7 93 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 8 52 

Qy 61 AGGGGCTTTTATTTC^CAAGCCCACAGGATACGGCT 120 

II I II II I II I II MM II II 1 1! II II I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 llllll 

Db 853 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 912 

Qy 121 ACAGGCAT CGTGGATGAGTG CTGCTTC CGGAG CTGTGATCTGAGGAGGCTGGAGATGTAC 180 

IMMMMMIMMMMMMMIMMMMIIMM II II Ml 1 1 II I MM! 

Db 913 ACAGG CAT CGTGGATGAGTG CTG CTTCCGGAGCTGTGATCTAAGGAGG CTGGAGATGTAT 972 

Qy 181 TGTG(^CCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

II II Ml II Ml MM II III MM Mill II II II II M M I II 1 1 1 II 1 1 1 1 

Db 973 TGCGCACCCCTC^GCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 1032 

Qy 241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAACAAGAAAATGAAGTC 300 

Illllllllll III 

Db 1033 ATGCCCAAGACCCAG 1047 

Qy 3 01 AGAAGGAAAGGAAGTACATTTGAAGAACA,CAAGTAGAGGGAGTGCAGGAAAC^GAACTA 3 60 

i 1 1 1 II II I M 1 1 1 II 1 1 II II II I II II I II 

Db 1048 AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTA 1100 

Qy 3 61 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 420 

MMMMMMMIMM MMIMMIMM MM | I I I I M I M II II I 
Db 1101 CAGGATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCCCCG 1160 

Qy 421 CTCTGCA 427 

Mill 

Db 1161 GGCTGCA 1167 



RESULT 6 

US-08-472-809B-7 

; Sequence 7, Application US/08472809B 

; Patent No. 5925564 

; GENERAL INFORMATION: 

APPLICANT : Schwartz, Robert J . 

APPLICANT : DeMayo, Franco J. 

APPLICANT: O'Malley, Bert W. 

TITLE OF INVENTION: Expression Vector Systems and 
TITLE OF INVENTION: Method of Use 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Lyon & Lyon 

STREET: 633 West Fifth Street 

STREET: Suite 4700 

CITY: Los Angeles 

STATE: California 

COUNTRY: U.S.A. 

ZIP: 90071-2066 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 3.5" Diskette, 1.44 Mb 

MEDIUM TYPE: storage 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: IBM P.C. DOS 5.0 

SOFTWARE: Word Perfect 5.1 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/472 , 8 09B 

FILING DATE: June 7, 1995 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/209,846 

FILING DATE: March 9, 1994 

APPLICATION NUMBER: 07/789,919 

FILING DATE: No. 5925564ember 6, 1991 
ATTORNEY/AGENT INFORMATION: 

NAME: Warburg, Richard J. 

REGISTRATION NUMBER : 32,327 

REFERENCE/DOCKET NUMBER: 214/212 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (213) 489-1600 

TELEFAX: (213) 955-0440 

TELEX: 67-3510 
; INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 6345 bases 

TYPE: nucleic acid 

STRANDEDNESS : double 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
US-08-472-809B-7 



Query Match 52.5%; 
Best Local Similarity 82.2%; 
Matches 351; Conservative 



Score 274.6; DB 2; Length 6345; 
Pred. No. 5.6e-76; 
0; Mismatches 24; Indels 52; Gaps 1; 



Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

MINIUM III MINN IIIIIIIIIMIIMIIIIIIIMIIIMIIIIIMIII 



Db 



37 02 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 3761 



Qy 61 AGGGGCTTTTATTTC^C^GCCCAC^GGATACGGCTCCAGC^GTCGGAGGGCACCTCAG 12 0 

IMIIIIIIIIIIIIIIIIIIIIIIIIII II llllllllllll I lllllll 1 1 II 1 1 

Db 3762 AGGGGCTTTTATTTO^CAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 3821 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

Mill Mill Illlllllllillllllll llllllllllll III llllllllllllll 

Db 3822 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 38 81 

Qy 181 TGTGCACCCCTC^GCCGGOU\AGGCAGCCCGCTCCGTCCGTGCC(^GCGCC^C^CCGAC 24 0 

II II Mill II 1 1 III II 111 MM Mill IMMMMMIMIMMIMM 

Db 3 8 82 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 3 941 

Qy 241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACC^CAAGAAAATGAAGTCTCAGAGG 3 00 

Milium 

Db 3 942 ATGCCCAAGACCCAG 3956 

Qy 3 01 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAG 3 60 

1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 I II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 III I! I II I . 

Db 3957 AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTA 4 00 9 

Qy 3 61 CAGGATGTAGGAAGACC CTTCTGAGGAGTGAAGAAGGACAGG C CACCG CAGGACCCTTTG 42 0 

1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 MMIMMIMM MM llllllllllll II I 

Db 4 010 CAGGATGTAGGAAGACC CTCCTGAGGAGTGAAGAGTGACATGCCACCG CAGGATC C CC CG 4 069 

Qy 421 CTCTGCA 427 

Mill 

Db 4070 GGCTGCA 4 076 



RESULT 7 
5405942-13 
Patent No. 5405942 

APPLICANT: BELL, GRAEME I . ; RALL , LESLIE B . ; MERRYWEATHER , 
JAMES P. 

TITLE OF INVENTION: PREPRO INSULIN-LIKE GROWTH FACTORS 
I AND II 

NUMBER OF SEQUENCES: 16 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/65,673 
FILING DATE: 16-JUN-1987 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 63 0,557 
FILING DATE: 19-JUL-1984 
SEQ ID NO: 13 : 

LENGTH: 357 
5405942-13 

Query Match 44.8%; Score 234.4; DB 6; Length 357; 

Best Local Similarity 93.8%; Pred. No. 5.1e-64; 

Matches 244; Conservative 0; Mismatches 16; Indels 0; Gaps 0; 
Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

II 1 1 M 1 1 1 1 1 1 1 1 M 1 1 II I Mill 1 1 lllllll I lllllll III MUM II Ml II 

Db 4 3 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 102 



Dv 


61 


Db 


103 


Dv 


121 


Db 


163 


Qy 


181 


Db 


223 


Qy 


241 


Db 


283 



I I I I I 1 M I t t I I t I I 1 I I I I I I I t I I I t II V : , " : llllll 

AGGGGCTTTTATTTC^O^GCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 

ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 

Mill IIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIII MMIIIIIIIMIIM 

ACAGGTATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 
TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 

II MM I MM I Ml II III MM Mill II II II I II 1 1 II 1 1 1 II 1 1 M 1 1 

TGCGCACCCCTCAGGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 

ATGCCCAAGACTCAGAAGTA 26 0 
Illllllllll MUM I 



RESULT 8 
5405942-9 

Patent No. 5405942 

APPLICANT: BELL, GRAEME I . ; RALL , LESLIE B . ; MERRYWEATHER , 

JAMES P. 

TITLE OF INVENTION: PREPRO INSULIN-LIKE GROWTH FACTORS 
I AND II 

NUMBER OF SEQUENCES: 16 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/65,673 
FILING DATE: 16-JUN-1987 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 63 0,557 
FILING DATE: 19-JUL-1984 
SEQ ID NO: 9 : 

LENGTH: 3 57 
5405942-9 

Query Match 44.5%; Score 232.8; DB 6; Length 357; 

Best Local Similarity 76.5%; Pred. No. 1.6e-63; 

Matches 199; Conservative 44; Mismatches 17; Indels 0; Gaps 0; 
Qy l GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

1 1 1 II II I M M h h 1 1 M II M 1 1 h I h M I I M h M I h M h h h 1 1 1 1 1 1 

Db 43 GGACCGGAGACGCUCUG CGGGG CUGAGCUGGUGGACG CUCUUCAGUUCGUGUGUGGAGAC 102 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGG 12 0 

. : I IIMIIIIIIhlllMIM 

Db 103 AGGGGCUUUUAUUUCAACAAGCCC^CAGGGU 162 

Q Y 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

Mill hlhllhllhlhl -I \:\^\\^\^ MIMIhllllhhl 

Db 163 ACAGGUAUCGUGGAUGAGUGCUGUUUCCGGAGCUGUGAUCUAAGGAGGCUGGAGAUGUAU 222 

Qy 181 TGTGCACCCCTCAAGCCGGC^VAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

: | llllllhllllll II Ml MM llhl I U I I H I I I I 1 I I I I I I I I I I I I 

Db 223 UGCGC^CCCCUCAAGCCUGCC^GUCAGCUCGCUCUGUCCGUGCCCAGCGCCACACCGAC 282 

Qy 241 ATGCCCAAGACTCAGAAGTA 260 

hIMIIIIII llllll I 



Db 



283 AUGCCCAAGACCCAGAAGGA 302 



RESULT 9 
5405942-7 

/Patent No. 5405942 

APPLICANT: BELL, GRAEME I . ; RALL , LESLIE B . ; MERRYWEATHER , 
; JAMES P. 

TITLE OF INVENTION: PREPRO INSULIN- LIKE GROWTH FACTORS 
;I AND II 

NUMBER OF SEQUENCES: 16 
CURRENT. APPLICATION DATA: 

APPLICATION NUMBER: US/07/65,673 
FILING DATE: 16-JUN-1987 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 630,557 
FILING DATE: 19-JUL-1984 
;SEQ ID NO: 7: 

LENGTH: 210 
5405942-7 



Query Match 36.6%; Score 191.4; DB 6; Length 210; 

Best Local Similarity 75.1%; Pred. No. 1.2e-50; 

Matches 157; Conservative 41; Mismatches 11; Indels 0; Gaps 0; 

Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

II I I MM I I I h h II I I Ih MM: I hi I h I hi- I lh:| h h h I I I || 

Db 1 GGACCGGAGACG CUCUG CGGGGCUGAG CUGGUGGAUGCUCUUCAGUUCGUGUGUGGAGAC 60 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTC 120 

I i II M- M ^ M II M M 1 1 M II I M 1 1 h 1 1 1 1 1 1 h 1 1 1 1 1 1 1 1 Ihlll 

Db 61 AGGGGCUUUUAUUUCAACAAGCCCACAGGGUA 120 

Qy 121 ACAGGCATCGTGGATGAGTG CTG CTTCCGGAG CTGTGAT CTGAGGAGGCTGGAGATGTAC 180 

Mill , ■ M I I II I! hi I II hhl 

Db 121 ACAGGUAUCGUGGAUGAGUG CUG CUUCCGGAG CUGUGAUCUAAGGAGGCUGGAGAUGUAU 18 0 



Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGC 209 

M 1 1 MUM MM II II! 1 1 1 1 

Db 181 UGCGCACCCCUCAAGCCUGCCAAGUCAGC 20 9 



RESULT 10 

5405942-11. 

; Patent No. 5405942 

APPLICANT: BELL, GRAEME I.; RALL, LESLIE B .; MERRYWEATHER , 
; JAMES P. 

TITLE OF INVENTION: PREPRO INSULIN-LIKE GROWTH FACTORS 
;I AND II 

NUMBER OF SEQUENCES: 16 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/65,673 

FILING DATE: 16-JUN-1987 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 630,557 

FILING DATE: 19-JUL-1984 
;SEQ ID NO: 11: 



LENGTH: 210 
5405942-11 



Query Match 36.6%; Score 191.4; DB 6; Length 210; 

Best Local Similarity 94.7%; Pred. No. 1.2e-50; 

Matches 198; Conservative 0; Mismatches 11; Indels 0; Gaps 0 
Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I II 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 ' J MM 1 1 1 1 1 II 1 1 1 1 1 1 1! I II 1 1 1 M 

Db 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 

Ml I II I M I II Ml I M M 1 1 II 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 I Mill 

Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

Qy 121 ACAGGCATCGTGGATGAGTG CTG CTTC CGGAG CTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

Mill I M I IMM I II M I 1 1 1 Ml MUM 1 1 1 1 1 M 1 1 1 M 1 1 M I 

Db 121 ACAGGTATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 18 0 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGC 2 09 

II MIMMIMM II Ml 

Db 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGC 2 09 



RESULT 11 
US-09-255-829-13 

Sequence 13, Application US/09255829 
Patent No. 6461617 
GENERAL INFORMATION: 

APPLICANT: Shone, Clifford Charles 
APPLICANT: Quinn, Conrad Padraig 
APPLICANT: Foster, Keith Alan 

TITLE OF INVENTION: Recombinant Toxin Fragments 
NUMBER OF SEQUENCES: 2 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: STERNE, KESSLER, GOLDSTEIN, & FOX P.L.L.C. 
STREET: 1100 NEW YORK AVENUE , NW, SUITE 600 
CITY: WASHINGTON 
STATE : DC 
• COUNTRY: USA 
ZIP: 20005-3934 
COMPUTER READABLE FORM : 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/255 , 82 9 
FILING DATE: 23-FEB-1999 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/GB97/02273 
FILING DATE: 22-AUG-1997 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/782,893 
FILING DATE: 27-DEC-1996 
ATTORNEY/AGENT INFORMATION: 
NAME: ESMOND, ROBERT W. 



REGISTRATION NUMBER : 32,8 93 
REFERENCE / DOCKET NUMBER: 1581.0130002 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 202-371-2600 
TELEFAX: 202-371-254 0 
INFORMATION FOR SEQ ID NO: 13: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 2862 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 
MOLECULE TYPE: DNA (genomic) 
FEATURE : 

NAME /KEY : CDS 
LOCATION: 1..2862 
US-09-255-829-13 



Query Match 36.6%; Score 191.4; DB 4; 

Best Local Similarity 94.7%; Pred. No. 4.8e-50; 
Matches 198; Conservative 0; Mismatches 11; 



Length 28 62; 
Indels 0; 



Gaps 



0; 



Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

1 1 II I 1 1 1 1 1 1 M I II I MM 1 1 1 1 1 1 1 1 I II 1 1 1 1 Ml I M I II I 1 1! I 

Db 2644 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 2703 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCAC 12 0 

'IIIIIIIIIIIIIIIIIIIIIIIIIIIII II Illlllllllllllllllll I I I I I I 
Db 27 04 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 2763 



Qy 121 ACAGGCATCGTGGATGAGTG CTG CTTC CGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I II I I lllllllllllllllll 
Db 2764 ACAGGTATCGTGGATGAGTG CTG CTTC CGGAG CTGTGATCTAAGGAGGCTGGAGATGTAT 2823 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGC 2 09 

I MUM Ml Mill I III III 

Db 2824 TGCGCACCCCTCAAGCCTGCCAAGTCAGC 2 852 



RESULT 12 

5405942-15 

; Patent No. 5405942 

APPLICANT: BELL, GRAEME I . ; RALL , LESLIE B . ; MERRYWEATHER , 
; JAMES P. 

TITLE OF INVENTION: PREPRO INSULIN-LIKE GROWTH FACTORS 
;I AND II 

NUMBER OF SEQUENCES: 16 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/65,673 
FILING DATE: 16-JUN-1987 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 63 0,557 
FILING DATE: 19-JUL-1984 
;SEQ ID NO: 15: 

LENGTH: 210 
5405942-15 

Query Match 36.3%; Score 189.8; DB 6; Length 210; 



Best Local Similarity 74.6%; Pred. No. 3.8e-50; 

Matches 156; Conservative 41; Mismatches 12; Indels 0; Gaps 



0; 



Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 6 0 

MINI IIMhh II I I : : I " I I ^ I I I - ■ ! " I " M I I - ^ I I " I ^ I : I I I I I 

Db 1 GGACCGGAGACGCUCUG CGGGGCUGAG CUGGUGGAUG CUCUUCAGUUCGUGUGUGGAGAC 60 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCAC^GGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

MINI: : = :| = :: I III III Mill III I I I I • I I I I I I I ' I I I I I I II Ihlll 
Db 61 AGGGGCUUUUAUUUCAACAAGCCCACAGGGUA 12 0 

Qy 121 ACAGG CATCGTGGATGAGTG CTG CTTCCGGAG CTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

I I I I I h I h I I h I I I : I I : I h : I I I I I I h I : I I : h llllllhllllhhl 
Db 121 ACAGGUAUCGUGGAUGAGUGCUGCUUC CGGAGCUGUGAUCUAAGGAGG CUGGAGAUGUAU 18 0 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGC 2 09 

M IMMIhll III II III MM 

Db 181 UGCGCACCCCUCAGGCCUGCCAAGUCAGC 209 



RESULT 13 
US-08-308-196A-1 

Sequence 1, Application US/08308196A 
Patent No. 5612198 
GENERAL INFORMATION: 

APPLICANT: Brierley, Russell A. 
APPLICANT: Davis, Geneva R. 
APPLICANT: Holtz, Gregory C. 
APPLICANT: Gleeson, Martin A. 
APPLICANT: Howard, Bradley D. 

TITLE OF INVENTION: Production of Insulin-Like Growth 
TITLE OF INVENTION: Factor-1 in Methylotrophic Yeast Cells 
NUMBER OF SEQUENCES: 17 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Brown, Martin, Haller & McClain 
STREET: 166 0 Union Street 
CITY: San Diego 
STATE: California 
COUNTRY : USA 
ZIP: 92101-2926 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patent In Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/308 , 196A 
FILING DATE: 09-SEPT-1994 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/983,523 
FILING DATE: 03-MAR-1993 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/578,728 
FILING DATE: 04-SEP-1990 
ATTORNEY/AGENT INFORMATION: 
NAME: Seidman, Stephanie L. 
REGISTRATION NUMBER: 33,779 



REFERENCE / DOCKET NUMBER: 51875 
TELECOMMUNICATION INFORMATION : 

TELEPHONE: (619)238-0999 

TELEFAX: (619)238-0062 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 24 0 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : double 

TOPOLOGY : unknown 
MOLECULE TYPE: cDNA 
FEATURE : 

NAME/ KEY: CDS 

LOCATION: 14..232 
US-08-308-196A-1 



Query Match 35.4%; Score 18 5.2; DB 1; Length 240; 

Best Local Similarity 91.6%; Pred, No. l.le-48; 

Matches 196; Conservative 0; Mismatches 18; Indels 0; Gaps 0; 
Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

IIIIIIIIIIIMIMIMI MINIM 

Db 17 GGACCGGAGACGCTCTGCGGGGCTGAGCTCGTGGATGCTCTGCAGTTCGTGTGTGGAGAC 76 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTC 12 0 

I III 1 1 MM 1 1 MINI Ml 1 1 MUM II Ml II MM MM! MM I Mill 

Db 77 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGACGGGCGCCTCAG 136 

Qy 121 ACAGGCATCGTGGATGAGTG CTGCTTC CGGAG CTGTGAT CTGAGGAGG CTGGAGATGTAC 18 0 

MM I II III II Mill I MM I MM 1 1 1 1 1 1 1 1 II II II MM Ml I I II I III I 

Db 137 ACAGGCATCGTGGATGAGTG CTG CTTCCGGAGCTGTGATCTAAGGAGGCT CGAGATGTAT 196 

Qy ■ 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCT 214 

II M MUM MUM M III MM II 

Db 197 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTTGAT 23 0 



RESULT 14 
PCT-US91-06452-1 

Sequence 1, Application PC/TUS9106452 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Brierley, Russell 
Davis, Geneva R. 
Holtz, Gregory C. 
Gleeson, Martin A. 
Bradley, D. H. 



TITLE OF INVENTION: Production of Insulin-Like Growth 
TITLE OF INVENTION: Factor-1 in Methyl o trophic Yeast Cells 
NUMBER OF SEQUENCES: 12 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: Fitch, Even, Tabin & Flannery 

STREET: 135 South LaSalle Street, Suite 900 

CITY: Chicago 

STATE: Illinois 

COUNTRY : USA 

ZIP: 60603 
COMPUTER READABLE FORM: 



MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS -DOS 
SOFTWARE: Patent In Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US91/06452 
FILING DATE: 19910409 
CLASSIFICATION: 43 5 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/578,728 
FILING DATE: 04-SEP-1990 
ATTORNEY/ AGENT INFORMATION: 
NAME: Seidman, Stephanie L. 
REGISTRATION NUMBER: 33 , 77 9 
REFERENCE/DOCKET NUMBER: 518 74 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (619)552-1311 
TELEFAX: (619)552-0095 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 240 base pairs 
TYPE: NUCLEIC ACID 
STRANDEDNESS : double 
TOPOLOGY : unknown 
MOLECULE TYPE: cDNA 
FEATURE : 

NAME/ KEY: CDS 
LOCATION: 14.. 232 
PCT-US91-06452-1 

Query Match 35.4%; Score 185.2; DB 5; Length 240; 

Best Local Similarity 91.6%; Pred. No. l.le-48; 

Matches 196; Conservative 0; Mismatches 18; Indels 0; Gaps 0; 
Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

II 1 1 1 Ml 1 1 1 II 1 1 1 1 1 MINIM lllllllllll II : : -M 

Db 17 GGACCGGAGACGCTCTGCGGGGCTGAGCTCGTGGATGCTCTGCAGTTCGTGTGTGGAGAC 76 

Qy 61 AGGGGCTTTTATTT(^CAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 

III llllll Illlllllllllllllllll II II Mill IIMI II MM 1 1 III I 

Db 77 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCOVGCAGTCGACGGGCGCCTCAG 136 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

M . ■ M M : MMIMI MIMIII 

Db 137 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTCGAGATGTAT 196 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCT 214 

II IMIIIIIMIIII II III MM I I 

Db 197 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTTGAT 23 0 



RESULT 15 
US-09-029-267-13 

; Sequence ,13, Application US/09029267 

; Patent No. 6107057 

; GENERAL INFORMATION: 

APPLICANT: Crawford, Kenneth 



APPLICANT: Zaror, Isabel 
APPLICANT: Innis, Michael 

TITLE OF INVENTION: Pichia Secretory Leader for Protein 
TITLE OF INVENTION: Expression 
NUMBER OF SEQUENCES: 4 0 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Chiron Corporation 

STREET: 4560 Horton Street 

CITY: Emeryville 

STATE: California 

COUNTRY: United States 

ZIP: 94608 
COMPUTER READABLE FORM: 
; . MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patent In Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/029,267 

FILING DATE: 

CLASSIFICATION: 
ATTORNEY/AGENT INFORMATION: 
; NAME : Chung, Ling-Fong 

REGISTRATION NUMBER: 36,482 

REFERENCE/DOCKET NUMBER: 1165.100 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (510) 601-2704 

TELEFAX: (510) 655-3 542 
; INFORMATION FOR SEQ ID NO: 13: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 390 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : S ingl e 

TOPOLOGY : 1 inear 
MOLECULE TYPE: other nucleic acid 

DESCRIPTION: /desc = "Synthetic" 
US-09-029-267-13 

Query Match 35.4%; Score 185.2; DB 3; Length 390; 

Best Local Similarity 91.6%; Pred. No. 1.5e-48; 

Matches 196; Conservative 0; Mismatches 18; Indels 0; Gaps 0; 
Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 



Db 



160 




Qy 



61 



AGGGGCTTTTATTTCAACAAGCCC^CAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 



Db 



220 




Db 



Qy 



280 



121 




Qy 



181 



TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCT 214 



Db 



340 



TGCGCACCCCTCAAGCCTGCCAAGTCAGCTTGAT 373 




Search completed: December 13, 2003, 11:44:50 
Job time : 49.3585 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 

Run on: December 13, 2003, 07:29:55 ; Search time 233.512 Seconds 

(without alignments) 
7443.919 Million cell updates/sec 

Title: US-09-852-261-5 
Perfect score: 523 

Sequence: 1 ggaccggagacgctctgcgg aaatacacaagtaaacattc 523 



Scoring table: I DENT I TY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 2201672 seqs, 1661799599 residues 

Total number of hits satisfying chosen parameters: 4403344 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Published_Applications_NA : * 

1 : /cgn2_6/ptodata/2/pubpna/US07__PUBCOMB . seq : * 

2 : /cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB . seq: * 

3 : /cgn2_6/ptodata/2/pubpna/US06_NEW_PUB . seq: * 

4 : /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB . seq : * 

5 : /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB . seq : * 

6 : /cgn2__6/ptodata/2/pubpna/PCTUS_PUBCOMB. seq: * 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB . seq : * 

8: /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB. seq:* 

9: /cgn2_6/ptodata/2/pubpna/US09A_PUBCOMB. seq:* 

10 : /cgn2_6/ptodata/2/pubpna/US09B__PUBCOMB . seq: * 

11 : /cgn2_6/ptodata/2/pubpna/US09C_PUBCOMB .seq: * 

12 : /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB . seq : * 

13 : /cgn2__6/ptodata/2/pubpna/US09_NEW_PUB . seq2 : * 

14 : /cgn2_6/ptodata/2/pubpna/US10A_PUBCOMB . seq : * 

15 : /cgn2_6/ptodata/2/pubpna/US10B_PUBCOMB . seq : * 

16 : /cgn2_6/ptodata/2/pubpna/US10_NEW_PUB . seq : * 

17: /cgn2_6/ptodata/2/pubpna/US60_NEW_PUB.seq: * 

18 : /cgn2_6/ptodata/2/pubpna/US60_PUBC0MB .seq: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

o 
o 

Result Query 

No. Score Match Length DB ID Description 
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ALIGNMENTS 



RESULT 1 
US-09-852-261-5 

; Sequence 5, Application US/09852261 

; Patent No. US20020083477A1 

; GENERAL INFORMATION: 

; APPLICANT: GOLDSPINK, GEOFFREY 



APPLICANT: TERENGHI , GIORGIO 
TITLE OF INVENTION: REPAIR OF NERVE DAMAGE 
FILE REFERENCE: 117-351 

CURRENT APPLICATION NUMBER: US/ 09/ 852 , 261 
CURRENT FILING DATE: 2001-05-10 
PRIOR APPLICATION NUMBER : GB 0011278.9 
PRIOR FILING DATE: 2000-05-10 
NUMBER OF SEQ ID NOS: 14 
SOFTWARE: Patent In Ver. 2.1 
SEQ ID NO 5 
LENGTH: 523 
TYPE: DNA 

ORGANISM: Oryctolagus cuniculus 
US-09-852-261-5 

Query Match 100.0%; Score 523; DB 9; Length 523; 

Best Local Similarity 100.0%; Pred. No. . 1. 9e-161; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

1 1 1 M I IM M I II M II II Ml 1 1 Ml I II 1 1 1 M 1 1 1 1 II M 1 1 II II 

Db 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCAC^GGATACGGCTCCAGCAGTCGG 120 

IIIIIMIIIIIIIIIII Ml III Mill I IIIIMIIIMII MINIM MINIMI 

Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCC^GCAGTCGGAGGGCACCTCAG 120 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

'M M 1 1 1 1 1 IM II I 1 1 1 Ml 1 1 1 M II 1 1 1 M I III II Ml 1 1 1 1 II II 

Db 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCA.CACCGAC 240 

1 1 1 1 II III 1 1 1 M I MM I II II I MM 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M II 1 1 1 II 1 1 

Db 181 TGTGCACCCCTC^GCCGGCAAAGGCAGCCCGCTCCGTCCGTGCC(^GCGCCACACCGAC 24 0 

Qy 241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAAC 300 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 

Db 241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAACAAGAA 300 

Qy 301 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGA 360 

IMMIIIIIIIIII i MMMMMMMMMMMIMM! ' IMMIII 

Db 301 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGC^GGAAAa^GAACTA 360 

Qy 361 CAGGATGTAGGAAGACCCTT CTGAGGAGTGAAGAAGGACAGGC CACCG CAGGAC CCTTTG 420 

1 1 1 1 II M 1 1 1 1 1 M II M I II M 1 1 Ml II I M III 1 1 1 1 1 , II 1 1 1 II 1 1 

Db 361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGC CACCG CAGGACCCTTTG 420 

Qy 421 CTCTGCACAGTTACCTGTAAACATTGGAATACCG^^ 480 

1 1 1 1 1 1 1 1 1 ill . I III 1 1 Ml I M I MM I M 1 1 1 II 1 1 1 M 1 1 1 1 1 II MM 

Db 421 CTCTGC^CAGTTACCTGTAAAO^TTGGAATACCGGCCAAAAAATAAGTTTGATCACATTT 480 

Qy 481 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

1 1 1 II 1 1 Mill I III 1 1 Ml I II I MM 1 1 1 1 1 1 II I 

Db 481 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 



RESULT 2 



US-09-852-261-1 

; Sequence 1, Application US/09852261 

; Patent No. US20020083477A1 

; GENERAL INFORMATION: 

; APPLICANT: GOLDSPINK, GEOFFREY 

; APPLICANT: TERENGHI , GIORGIO 

; TITLE OF INVENTION: REPAIR OF NERVE DAMAGE 

; FILE REFERENCE: 117-351 

; CURRENT APPLICATION NUMBER: US/09/852 , 261 

; CURRENT FILING DATE: 2001-05-10 

; PRIOR APPLICATION NUMBER: GB 0011278.9 

; PRIOR FILING DATE: 2000-05-10 

; NUMBER OF SEQ ID NOS : 14 

; SOFTWARE: PatentlnVer. 2.1 

; SEQ ID NO 1 

LENGTH: 517 

TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-852-261-1 

Query Match 89.4%; Score 467.4; DB 9; Length 517; 

Best Local Similarity 96.2%; Pred. No. 3.8e-143; 

Matches 501; Conservative 0; Mismatches 16; Indels 4; Gaps 2; 
Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

I I I I I I I I I III Mill I I i I I Ml I I M • I Ml 

Db 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTTTTATTTC^CAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGC^CCTCAG 12 0 

I Ml III II I Ml II I II I II III II III II 

Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 12 0 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

MMMMMMMMMMMMMIMMMIMIMM II 1 1 1 1 1 1 1 M 1 1 1 1 1 1 

Db 121 ACAGGCATCGTGGATGAGTG CTGCTTC CGGAG CTGTGATCTAAGGAGG CTGGAGATGTAT 18 0 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

II 1 1 II 1 1 MM II II II III MM Mill Ml MUM I II Ml III 1 1 II II 

Db 181 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 24 0 

Qy 241 ATGCCCAAGACTCAGAAGTATC^GCCTCCATCTACCAACAAGAAAATGAAGTCTCAGAGG 3 00 

1 1 MUM III IIIIIIIIIIMM IIIMIIIIMIIIIM I MM Mill I 

Db 241 ATGCCCAAGACCCAGAAGTATCAGCCCCCATCTACCAACA^ G 2 97 

Qy 301 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

I I II I I I I I I M I I I I II I M I II II I I I M II I I I I I I II II I II II M I II I II I I II 
Db 2 98 AGAAGGAAAGGAAGTAC^TTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 357 

Qy 361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 42 0 

1 1 1 1 1 i M I MMMMMMMMMMMMMMMMMMMMMIMIMM 

Db 358 CAGGATGTA - GAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 416 

Qy 421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGAT(^ 48 0 

I I I II I I I I II II M I II I I I I II I II I II I I I I II M II II I I I II I I II I I I I II I I I 
Db 417 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCAAAAAATAAGTTTGATC^CATTT 476 

Qy 481 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACAT 521 



IMMMMMMMMMMMMIMMMMMIIIM 

Db 477 CAAAGATGG CATTTCC CC CAATGAAATACACAAGTAAACAT 517 



RESULT 3 

US-09-852-261-13 

; Sequence 13, Application US/09852261 

; Patent No. US20020083477A1 

; GENERAL INFORMATION: 

; APPLICANT: GOLDS PINK, GEOFFREY 

; APPLICANT: TERENGHI , GIORGIO 

; TITLE OF INVENTION: REPAIR OF NERVE DAMAGE 

; FILE REFERENCE: 117-351 

; CURRENT APPLICATION NUMBER: US/ 09/852 , 261 
; CURRENT FILING DATE: 2001-05-10 

PRIOR APPLICATION NUMBER : GB 0011278.9 
; PRIOR FILING DATE: 2000-05-10 
; NUMBER OF SEQ ID NOS : 14 
; SOFTWARE: Patent In Ver. 2.1 
; SEQ ID NO 13 

LENGTH: 471 

TYPE: DNA 
; ORGANISM: Oryctolagus cuniculus 
US-09-852-261-13 

Query Match 78.2%; Score 4 09; DB 9; Length 471; 

Best Local Similarity 90.1%; Pred. No. 6.1e-124; 

Matches 471; Conservative 0; Mismatches 0; Indels 52; Gaps 1; 



Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Ml llllll MIMI MMMI IMIM MUM MMIMMMMMIMM 

Db 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

IIMIIMIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIMIIIIIIIIIIIIIIIIII 

Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 

Qy 121 ACAGG CATCGTGGATGAGTG CTGCTTCCGGAG CTGTGAT CTGAGGAGG CTGGAGATGTAC 18 0 

IIIIIIIIIIIIIIMIIIIIIIIIMII III Mill Mill IMIIMM MIMM 

Db 121 ACAGG CATCGTGGATGAGTGCTG CTT C CGGAG CTGTGATCTGAGGAGG CTGGAGATGTAC 180 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 240 

III IIIMII Mill MINI: Mill MINI MM II MM II Mill 

Db 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

Qy 241 ATG CCCAAGACTCAGAAGTATCAG CCTCCAT CTACCAACAAGAAAATGAAGTCT CAGAGG 300 

MMMMMIMM 

Db 241 ATGCCCAAGACTCAG 255 

Qy 3 01 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGA 360 

1 1 : 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 I M I M 1 1 1 1 1 1 1 1 ill! Mill I 

Db 256 AAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 308 

Qy 361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 420 

1 1 1 1 1 1 1 II I M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 M 

Db ■ 3 0 9 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 368 



Qy 421 CTCTGCACAGTTACCTGTAAAC^TTGGAATACCGGCCAAAAAATAAGTTTGATCACATTT 480 

IIMIIIIMIIIIIIIIIIIIIIIMMMMIIMIIIIMMIIIIIMMIIIIM 

Db 3 69 CTCTGCACAGTTACCTGTAAACATTGGAATACCG 428 

Qy 481 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

Ml II III II MUM III III II Mill II Ml I Mill Ml 

Db 42 9 CAAAGATGGCATTTCCCCCAATGAAATACACAAGTAAACATTC 471 



RESULT 4 
US-09-852-261-3 

Sequence 3, Application US/09852261 
Patent No. US20020083477A1 
GENERAL INFORMATION: 
APPLICANT: GOLDSPINK, GEOFFREY 
APPLICANT: TERENGHI , GIORGIO 
TITLE OF INVENTION: REPAIR OF NERVE DAMAGE 
FILE REFERENCE: 117-351 

CURRENT APPLICATION NUMBER: US/ 09/8 52 , 2 61 
CURRENT FILING DATE: 2001-05-10 
PRIOR APPLICATION NUMBER: GB 0011278.9 
PRIOR FILING DATE: 2000-05-10 
NUMBER OF SEQ ID NOS : 14 
SOFTWARE: Patentln Ver . 2.1 
SEQ ID NO 3 
LENGTH:" 53 9 
TYPE : DNA 

ORGANISM: Rattus sp . 
US-09-852-261-3 

Query Match 68.2%; Score 356.8; DB 9; Length 539; 

Best Local Similarity 82.3%; Pred. No. 9.9e-107; 

Matches 436; Conservative 0; Mismatches 87; Indels 7; Gaps 2; 
Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Mill Mill II Mill 1 IIMIMIIIIIMMIIIII 

Db 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

1 1 1 M 1 1 II 1 1 MIMMMMMMI II MIMIIIII MIMMMIM Ml 

Db 61 AGGGGCTTTTACTTCAACAAGCCCACAGTCTATGGCTCCAGCAT 120 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

I f Mill MIMMIMI IIIIIIIIMIMIIIIIMIIMIIIIIIMIMIIM 

Db 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

Qy 181 TGTGCACCCCT(^AGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

MM II MM II Mill MM II III MM IIM IIIIMIIII Ml 

Db 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

Qy 241 ATGCCCAAGACTCAG^GTATC^GCCTCC^TCTACCAACAAGAAAATGAAGTCTCAGAG^ 300 

MMIMMMMIIMM Mill I Ml II MIIIIMI MM II Ml 

Db 241 ATGCCCAAGACTCAGAAGTCCCAGCCCCTATCGACACACAAGAAAAGGAAGCTGCAAAGG 3 00 

Qy 3 01 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

1 1 1 1 1 1 i 1 1 i ! 1 1 1 1 1 1 1 MMMIMMMMIMM MMMIMMIMM III 

Db 3 01 AGAAGGAAAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 3 60 



Qy 361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGC CAC CGCAGGAC CCTTTG 420 

III llllllll II III I Mill I M I II I I II II 1 1 M MMI I 

Db 361 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 420 
Qy 421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATC 474 

II I llllll MM III MM M III MM II 

Db 421 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAATATCAATAATGAGTTCAATA 48 0 

Qy 475 ACATTTCAAAGAT-GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

III II 1 1 Mil llllllllll IIIIIMMMIIIIIIIIIIIMI 

Db 481 TCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTC 530 



RESULT 5 
US-10-161-088-1 

Sequence 1, Application US/10161088 
Publication No. US2 003 0077761A1 
GENERAL INFORMATION: 
APPLICANT: Parrow, Vendela 
APPLICANT: Rosengren, Linda 
TITLE OF INVENTION: NEW METHODS 
FILE REFERENCE: 13425-111001 
CURRENT APPLICATION NUMBER : US/10/161 , 088 
CURRENT FILING DATE: 2002-05-31 
PRIOR APPLICATION NUMBER: SE 0101934-8 
PRIOR FILING DATE: 2001-06-01 
NUMBER OF SEQ ID NOS : 3 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 1 
LENGTH: 651 
TYPE : DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (73) . . . (471) 
US-10-161-088-1 

Query Match 66.8%; Score 349.4; . DB 15; Length 651; 

Best Local Similarity 82.8%; Pred. No. 3e-104; 

Matches 425; Conservative 0; Mismatches 81; Indels 7; Gaps 2; 
Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Mill Mill II Mill MMMMMMIMMMMMMMMMIMM 

Db 13 9 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGACCG 198 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCC^GCAGTCGGAGGGCACCTCAG 12 0 

1 1 1 II M M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II IMIIIIIII MMMMMMIMI 

Db 199 AGGGGCTTTTACTTCAACAAGCCCACAGGCTATGGCTCCAGCATTCGGAGGGCACCTCAG 258 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

llllllll llllll I MM I II MM II MUM MUM II Ml II II MM 1 1 II 

Db 259 ACAGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGACTGGAGATGTAC 318 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTC 24 0 

MMI II II Mill I II I II II M II M I II MMI MUM II II 1 1 Ml 

Db 319 TGTGCCCCACTGAAGCCTACAAAAGCAGCCCGCTCTATCCGTGCCCAGCGCCACACTGAC 378 



Qy 241 ATGCCC^GACTCAGAAGTATCAGCCTCCATCTACC^ACAAGAAAATGAAGTCTCAGAGG 3 00 

IIIIIMIIIIIMMIII I I I I III II llllllllll 1 1 1 1 II III 

Db 37 9 ATGCCCAAGACTCAGAAGTCCCCGTCCCTATCGACAAACAAGAAAACGAAGCTGCAAAGG 43 8 

Qy 3 01 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

I I I I I I I I I I I II I I I I I I I I I I I I II II I I I II I I II I I I I I I I I I I I I I II I I III 

Db 43 9 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 4 98 

Qy 361 CAGGATGTAGGAAGACCCTT CTGAGGAGTGAAGAAGGACAGG CCAC CG CAGGACCCTTTG 42 0 

Ml llllllll II MM MM I II I II Mill Mill llllll 

Db 49 9 CAGAATGTAGGAGGAGCCTCCCACGGAGCAGAAAATGCCACATCACCGCAGGATCCTTTG 558 
Qy 421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATC 474 

II I llllll I II III III II lllllll II 

Db 559 CTGCTTGAGCAACCTGCAAAACATCGAAACACCTACCAAAT^ 618 

Qy 4 75 ACATTTCAAAGAT - GGCATTTCCCCCAATGAAA 506 

Mill II Mill II II MM I Mill I MM 

Db 619 ACATTACAAAGATGGGCATTTCCCCCAATGAAA 651 



RESULT 6 

US-09-919-497-24 

Sequence 24, Application US/09919497 
Patent No. US20020106662A1 
GENERAL INFORMATION : 
APPLICANT: Mutter, George L. 

TITLE OF INVENTION: PROGNOSTIC CLASSIFICATION OF ENDOMETRIAL CANCER 
FILE REFERENCE: B0801/7225 

CURRENT APPLICATION NUMBER: US/09/919,4 97 
CURRENT FILING DATE : 2001-07-31 
PRIOR APPLICATION NUMBER: US 60/221,735 
PRIOR FILING DATE: 2000-07-31 
NUMBER OF SEQ ID NOS : 100 
SOFTWARE: Patentln version 3.0 
SEQ ID NO 24 
LENGTH: 72 60 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-919-497-24 

Query Match 63.9%; Score 334.4; DB 10; Length 7260; 

Best Local Similarity 84.6%; Pred. No. 9e-99; 

Matches 445; Conservative 0; Mismatches 26; Indels 55; Gaps 4 

Qy 1 GGACCGGAGACG CTCTG CGGTG CTGAG CTGGTGGATG CTCTTCAGTTCGTGTGTGGAGAC 60 

I ■ I 1 i I : I : ' 

Db 311 GGACCGGAGACG CTCTG CGGGGCTGAGCTGGTGGATG CTCTTCAGTTCGTGTGTGGAGAC 370 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 120 

II 'I I II 1 1 1 1 1 1 1 M III 1 1 Ml II II Mil. llllll lllllll llllll 

Db 3 71 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 430 

Qy 121 ACAGGCAT CGTGGATGAGTG CTG CTTC CGGAG CTGTGATCTGAGGAGG CTGGAGATGTAC 180 

I! IIIIMIIIIIII Mill II IIIIIMIilllMIIII MINI MINIUM 

Db 431 ACAGGCATCGTGGATGAGTG CTG CTTCCGGAG CTGTGATCTAAGGAGG CTGGAGATGTAT 4 90 



Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

II II II I III MINI II III MM Mill llllllllllllllllllllllll 

Db 4 91 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 550 

Qy 241 ATG CC CAAGACTCAGAAGTATCAG C CTCCAT CTACCAACAAGAAAATGAAGTCTCAGAGG 300 

Illllllllll III 

Db 551 ATGCCCAAGACCCAG 565 

Qy 3 01 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGC^GGAAAC^GAACTA 360 

I I I I ! I I I ! M I I I I I I II Mill II I III II MUM II III II I II 

Db 566 AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTA 618 

Qy 361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 42 0 

E 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 IIIIIIIIIMIII MM IMMMIMM MUM 

Db 619 CAGGATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTG 678 

Qy 421 CTCTGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACAT 478 

IIIIIIII lllllll Mill MUM III I IN' 1 1 III 1 1 1 1 MM 

Db 679 CTCTGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACAT 73 8 

Qy 479 TTCAAAGAT-GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

II llllll III IIIIIIII MUM lllllll Mill I 

Db 739 TTAAAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACATTC 784 



RESULT 7 

US-09-880-107-3739 

; Sequence 3739, Application US/09880107 
; Patent No. US20020142981A1 
; GENERAL INFORMATION: 

APPLICANT: Home, Darci T. 
; APPLICANT: Vockley, Joseph G. 
; APPLICANT: Scherf, Uwe 

APPLICANT: Gene Logic, Inc. 

TITLE OF INVENTION: Gene Expression Profiles in Liver Cancer 
; FILE REFERENCE: 44 92 1 -5 028 -WO 
; CURRENT APPLICATION NUMBER: US/09/88 0,107 
; CURRENT FILING DATE: 2001-06-14 
; PRIOR APPLICATION NUMBER: US 60/211,379 
; PRIOR FILING DATE : 2000-06-14 
; PRIOR APPLICATION NUMBER: US 60/237,054 
; PRIOR FILING DATE: 2000-10-02 
; NUMBER OF SEQ ID NOS : 3950 
; SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 3739 

LENGTH: 7260 

TYPE: DNA 

ORGANISM : Homo sapiens 
FEATURE: 

OTHER INFORMATION: Genbank Accession No. US2002 0142981A1 X57025 
US-09-880-107-3739 



Query Match 63.9%; Score 334.4; DB 10; Length 7260; 

Best Local Similarity 84.6%; Pred. No. 9e-99; 

Matches 445; Conservative 0; Mismatches 26; Indels 55; Gaps 



4 



Qy 


1 


GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 

MIIIIIIMI! MINIM Ml 1 I I'l 1 1 1 1 Tl 1 1 ! I 1 

GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 


60 


Db 


311 


370 


Qy 


61 


AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 


120 


Db 


371 


illlllllMIMIIIIIIIIIIilllll 1! IIIIIIIIIIIIIIMIIII MINI 

AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 


430 


Qy 


121 


ACAGG CAT CGTGGATGAGTG CTG CTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 


180 


Db 


431 


MIM 1 1 1 1 1 1 Ml Ml 1 Ml 1 1 Mill 1 : 1 

ACAGGCATCGTGGATGAGTG CTGCTT CCGGAGCTGTGAT CTAAGGAGGCTGGAGATGTAT 


490 


Qy 


181 


TGTGCACCCCTCAAGCCGGO^GGCAGCCCGCTCCGTCCGTGCCC^GCGCCACACCGAC 


240 


Db 


491 


II MIMIIMMMI II III MM II Ml III MM Mill MUM MUM 

TGCGC^CCCCTCmGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 


550 


Qy 


241 


ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACC^C^GAAAATGAAGTCTCAGAGG 


300 


Db 


551 


MIIIIIIMI III 

ATGCCCAAGACCCAG ■ 


565 


Qy 


301 


AGAAGGAAAGG^UVGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 


360 


Db 


566 


II III II III III II MM II 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 i 1 ! 1 1 1 1 1 1 i 1 1 1 1 1 1 

AAGGAAGTACATTTGAAGAACG CAAGTAGAGGGAGTGCAGGAAACAAGAACTA 


618 


Qy 


361 


CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 


420 


Db 


619 


IMM II II Ml II Mill III Ml II III Ml 1 1 II 1 M II M II 1 1 1 II II II 

CAGGATGTAGGAAGACC CTCCTGAGGAGTGAAGAGTGACATG CCACCGCAGGATCCT.TTG 


678 


Qy 


421 


CTCTGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACAT 

1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 Mill 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 I 1 1 1 1 1 


478 


Db 


679 


1 II II 1 M 1 M 1 1 1 II 1 Mill II II II III 1 1 1 1 1 1 II II II 1 M 1 II II II 

CTCTGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACAT 


738 


Qy 


479 


TTCAAAGAT -GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 




Db 


739 


M llllll III MM II II III MM II III III III MM II 

TTAAAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACATTC 784 




RESULT 8 
US-09-873 


-319 


-707 





Sequence 707, Application US/09873319A 
Publication No. US2003 0134324A1 
GENERAL INFORMATION: 

APPLICANT: Munger, William E. 

APPLICANT: Kulkarni, Prakash 

APPLICANT: Getzenberg, Robert 

APPLICANT: Waga , Iwao 

APPLICANT: Yamamoto, Jun 

TITLE OF INVENTION: Identifying Drugs for and Diagnosis of Benign Prostatic 
TITLE OF INVENTION: Hyperplasia Using Gene Expression Profiles 
FILE REFERENCE: 44921-5029-US 
CURRENT APPLICATION NUMBER : US/09/873 , 319A 
CURRENT FILING DATE: 2001-06-05 
EARLIER APPLICATION NUMBER: US 60/223,323 
EARLIER FILING DATE: 2000-08-07 
NUMBER OF SEQ ID NOS : 755 
SOFTWARE: Patent In Ver. 2.1 
SEQ ID NO 707 



LENGTH: 7260 
TYPE: DNA 

ORGANISM : Homo sapiens 
FEATURE : 

OTHER INFORMATION: Genbank Accession No. US20030134324A1 X57025 
US-09-873-319-707 

Query Match 63.9%; Score 334.4; DB 13; Length 7260; 

Best Local Similarity 84.6%; Pred. No. 9e-99; 

Matches 445; Conservative 0; Mismatches 26; Indels 55; Gaps 4; 



Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

IIIMII lllllll: ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i . 1 1 1 1 1 1 1 1 1 M 1 1 

Db 311 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 370 

Qy 61 AGGGGCTTTTATTTC7WVCAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCT(^G 12 0 

II MMMIMIMM MUM lllllll II llllllllllllllllllll I MM I 

Db 3 71 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 43 0 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

II MUM IIIIIIIIMIMIMIII MMMIMIMM MM I II II I II I II 1 1 

Db 431 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 4 90 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

II 1 1 1 1 1 1 1 i I i i 1 1 1 II III MM Mill MMMMMMMIMMIMM 

Db 4 91 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 550 

Qy 241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAACAAGAAAATGAAGTCTCAGAGG 3 00 

I I I I I I I I I I III 

Db 551 ATGCCCAAGACCCAG 565 

Qy 3 01 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 3 60 

; 1 1 1 1 1 ! I 1 1 1 1 1 1 ! I . I 1 1 1 1 M Mil 1 1 1 1 I i 11 1 M 1 1 1 1 1 1 1 

Db 566 AAGGAAGTAC^TTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTA 618 

Qy 3 61 CAGGATGTAGGAAGAC CCTT CTGAGGAGTGAAGAAGGACAGGC CACCGCAGGACCCTTTG 42 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 i I MIMIIIIIIMI MM llilllllll MUM 

Db 619 CAGGATGTAGGAAGAC CCTCCTGAGGAGTGAAGAGTGACATGC CACCGCAGGAT C CTTTG 678 

Qy 421 CTCTGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACAT 478 

I lllllll 1 1 1 1 1 1 1 1 1 MM I Ml 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM 

Db 679 CTCTGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACAT 73 8 

Qy 479 TTCAAAGAT-GGC^TTTCCCCCAATGAAATACACAAGTAAACATTC 523 

II Ml Ml 1 1 I i 1 1 III I i J 1 1 1 1 ; 1 1 1 E 1 1 1 1 I 

Db 73 9 TTAAAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACATTC 784 



RESULT 9 

US-09-960-706-1066 

; Sequence 1066, Application US/09960706 

; Publication No. US20030134280A1 

; GENERAL INFORMATION: 

; APPLICANT: Munger, William E. 

TITLE OF INVENTION: Identifying Drugs for and Diagnosis of Benign Prostatic 
Hyperplasia Using 

; TITLE OF INVENTION: Gene Expression Profiles 



FILE REFERENCE: 4492 1 -5029-01US 
CURRENT APPLICATION NUMBER: US/ 09/ 960 , 706 
CURRENT FILING DATE : 2001-09-24 
PRIOR APPLICATION NUMBER: 60/223,323 
PRIOR FILING DATE: 2000-08-07 
PRIOR APPLICATION NUMBER: 09/873,319 
PRIOR FILING DATE: 2001-06-05 
NUMBER OF SEQ ID NOS : 1124 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 1066 
LENGTH: 7260 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 

OTHER. INFORMATION: Genbank Accession No. US20030134280A1 X57025 
US-09-960-706-1066 

Query Match 63.9%; Score -334.4; DB 13; Length 7260; 

Best Local Similarity 84.6%; Pred. No. 9e-99; 

Matches 445; Conservative 0; Mismatches 26; Indels 55; Gaps 4; 
Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 6 0 

llllllllllllll llllll Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 llllll 

Db 311 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 370 

Qy 61 AGGGGCTTTTATTTCAACAAGCCC^C^GGATA 120 

IIIIIMIIMMIIIIIIMIIIMIII II IIIIIIMIIIIIMMIII MINI 

Db 371 AGGGGCTTTTATTTCAA(^GCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 430 

Qy 121 ACAGG CATCGTGGATGAGTG CTGCTT CCGGAG CTGTGAT CTGAGGAGG CTGGAGATGTAC 18 0 

IIIIIIIIIIIIIIIIIMIIIMIIMIIIIIIIMIIM 

Db 431 ACAGGCAT CGTGGATGAGTGCTG CTTC CGGAG CTGTGATCTAAGGAGGCTGGAGATGTAT 4 90 

Qy 181 TGTGCACCCCT(^GCCGGCAAAGGCAGCCCGCTCCGTCCGTGCC(^GCGCCACACCGAC 24 0 

II II III MM Mill IIIIIIMIIMIIIIIMIIIII 

Db 4 91 TGCGC^CCCCTC^GCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 550 

Qy 241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAACAAG 3 00 

IMMIIMM Ml 

Db 551 ATGCCCAAGACCCAG 565 

Qy 301 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

IMMMMMIMMIMM llllll MMMMMM II I Ml I MMM 

Db 566 AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTA 618 

Qy 361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 42 0 

1 1 1 1 1 1 1 1 - llllllllllllll MM INN HI llllll 

Db 619 CAGGATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGC CACCGCAGGATC CTTTG 678 

Qy 421 CTCTGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACAT 478 

I II I II 1 1 lllllllll Mill llllll Ml MMMMIMMIMM MM 

Db 679 CTCTGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACAT 738 

Qy 47 9 TTCAAAGAT-GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

M llllll III I II II I II II I II II II I II I I I I I M I II II 

Db 73 9 TTAAAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACATTC 784 



RESULT 10 
US-10-136-639-4 

Sequence 4, Application US/10136639 
Publication No. US20030072761A1 
GENERAL INFORMATION: 
APPLICANT : LeBowitz, Jonathan 

TITLE OF INVENTION: METHODS AND COMPOSITIONS FOR TARGETING PROTEINS ACROSS 
THE BLOOD BRAIN 

TITLE OF INVENTION: BARRIER 
FILE REFERENCE: SYM-008 

CURRENT APPLICATION NUMBER: US/10/136 , 63 9 
CURRENT FILING DATE: 2002-09-06 
PRIOR APPLICATION NUMBER: US 60/329,650 
PRIOR FILING DATE: 2001-10-16 
NUMBER OF SEQ ID NOS : 4 
SOFTWARE: Patentln version 3.0 
SEQ ID NO 4 
LENGTH: 726 0 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-136-639-4 

Query Match 63.9%; Score 334.4; DB 15; Length 7260; 

Best Local Similarity 84.6%; Pred. No. 9e-99; 

Matches 445; Conservative 0; Mismatches 26; Indels 55; Gaps 4; 
Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

IIIIIIIIIIIIIIIIMII IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMI 

Db 311 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 370 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCC^ 120 

llllllllll illlllMIIIMI Mill M IIMIIIIIIIIIMIIIII MIMI 

Db 371 AGGGGCTTTTATTTCAAC^GCCCAC^GGGTATGGCTC(^G(^GTCGGAGGGCGCCTCAG 430 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 180 

1 1 1 1 1 M 1 1 1 II 1 1 M 1 1 II 1 1 M 1 1 1 1 1 ! 1 1 1 1 1 1 M 1 1 1 lllllllllllllllll 

Db 431 ACAGG CATCGTGGATGAGTG CTG CTTCCGGAG CTGTGATCTAAGGAGG CTGGAGATGTAT 4 90 

Qy 181 TGTG(^CCCCTC^GCCGGC7yW3G(^^ 24 0 

II MMMMMIMI II III MM Mill ; 1 

Db 4 91 TGCGCACCCCT(^GCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 550 

Qy 241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAACA^ 3 00 

MIMMIIM III 

Db 551 ATGCCCAAGACCCAG 565 

Qy 301 AGAAGGAAAGGAAGTAC^TTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

I II MM 1 1 II I M III I M I i I 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 566 AAGGAAGTACATTTGAAGAACG CAAGTAGAGGGAGTG CAGGAAACAAGAACTA 618 

Qy 361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGG CCACCGCAGGAC C CTTTG 42 0 

IIIIIIIIIIIIIIIIMI MMMIMIMM MM MMIMIMM MIMI 

Db 619 CAGGATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTG 678 

Qy 421 CTCTGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACAT 4 78 

I III MM 1 1 1 1 1 1 1 1 1 Mill MIMI III Ml II I Mill! II III I MM 



Db 67 9 CTCTGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACAT 738 



Qy 479 TTCAAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

M iiiiii mi i n i ii i ii mi i ii ii i n mi mi mi 1 1 

Db 73 9 TTAAAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACATTC 784 



RESULT 11 
US-10-207-655-54 

Sequence 54, Application US/10207655 
Publication No. US20030118592A1 
GENERAL INFORMATION: 
APPLICANT: Ledbetter, Jeffrey A. 
APPLICANT: Hayden-Ledbetter , Martha S. 

TITLE OF INVENTION: BINDING DOMAIN- IMMUNOGLOBULIN FUSION PROTEINS 
FILE REFERENCE: 390069. 401C1 
CURRENT APPLICATION NUMBER: US/10/207 , 655 
CURRENT FILING DATE: 2002-07-25 
NUMBER OF SEQ ID NOS : 426 
SOFTWARE: Patent In version 3.0 
SEQ ID NO 54 
LENGTH: 725 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-207-655-54 

Query Match 63.6%; Score 332.8; DB. 15; Length 725; 

Best Local Similarity . 84.4%; Pred. No. 9.2e-99; 

Matches 444; Conservative 0; Mismatches 27; Indels 55; Gaps 4; 
Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

M Mill Mill MM MM MM MMI MM II MM MINI 

Db 156 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 215 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCT 120 

MMI MM MM MMI M 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 ! II 1 1 IIIIII 

Db 216 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 275 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

Mill MMIM MMMIIMI IMMMMMM 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 

Db 276 ACAGGTATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 335 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCA.GCGC(^CACCGAC 24 0 

II II I Ml I II III II II Ml Ml 1 1 1 1 1 IIIIIIIMIIIIIIIIIMIIII 

Db 33 6 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 3 95 

Qy 241 ATGCCCAAGACTCAGAAGTATCAGCCTCCATCTACCAA^ 300 

Db 3 96 ATGCCCAAGACCCAG 410 

Qy 3 01 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 3 60 

IMIIIIIIIIIIIIIIMII III II I MM I MM 1 1 II III 1 1 Ml II 1 1 

Db 411 AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTA 463 

Qy 361 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 42 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IMMIIIIIIMI II M II 1 1 II I II M I IIIIII 

Db 464 CAGGATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATGCCACCGCAGGATCCTTTG 523 



Qy 421 CTCTGCAC - AGTTACCTG - TAAACATTGGAATACCGGCCAAAAAATAAGTTTGATCACAT 478 

Illlllll lllllllll Mill llllll III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i I MM 

Db 524 CTCTGCACGAGTTACCTGTTAAACTTTGGAACACCTACCAAAAAATAAGTTTGATAACAT 583 

Qy 47 9 TTCAAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

II llllll III MMMMMMMMMMMMIMIMM 

Db 584 TTAAAAGATGGGCGTTTCCCCCAATGAAATACACAAGTAAACATTC 62 9 



RESULT 12 
US-10-251-661-7 

Sequence 7, Application US/10251661 
Publication No. US20030166555A1 
GENERAL INFORMATION: 
APPLICANT: Alberini, Crist ina M . 
APPLICANT: Bear, Mark F . 

TITLE OF INVENTION: Methods and Compositions for Regulating 
TITLE OF INVENTION: Memory Consolidation 
FILE REFERENCE: 34 99.1001-003 
CURRENT APPLICATION NUMBER: US/10/251,661 
CURRENT FILING DATE: 2002-09-20 
PRIOR APPLICATION NUMBER: 60/193,614 
PRIOR FILING DATE: 2000-03-31 
PRIOR APPLICATION NUMBER: PCT/US0 1/ 1 066 1 
PRIOR FILING DATE: 2001-04-02 
. NUMBER OF SEQ ID NOS : 12 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 7 
LENGTH: 612 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE: 
NAME /KEY: CDS 
LOCATION: (151) . . . (564) 
US-10-251-661-7 

Query Match 52.3%; Score 273.6; DB 13; Length 612; 

Best Local Similarity 83.0%; Pred. No. 2.6e-79; 

Matches 347; Conservative 0; Mismatches 19; Indels 52; Gaps 1 
Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

M 1 1 1 1 M 1 1 1 1 1 1 1 f 1 1 1 ) MMMMIIMMIIMMMMMMIMIIMIMI 

Db 247 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 306 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGT 12 0 

U M M II 1 1 1 1 1 1 IN I ill 1 1 1 II I II 1 1 ! 1 1 1 1 1 ! 1 1 i 1 1 1 1 MUM 

Db 3 07 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATG 366 

Qy 121 ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

MMIMMMMMMMMMMMMIIMIMIMM M 1 1 1 1 1 1 1 M I M I II 

Db 3 67 ACAGG CATCGTGGATGAGTG CTG CTTCCGGAGCTGTGATCTAAGGAGGCTGGAGATGTAT 426 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

II llllllllllllll II III MM Mill Ml II 1 1 1 1 1 1 1 1 III II I II IN 

Db 427 TGCGCACCCCTCAAGCCTGCCAAGTCAGCTCGCTCTGTCCGTGCCCAGCGCCACACCGAC 4 86 



Qy 241 ATGCCCAAGACTC^GAAGTATC^GCCTC 300 

Illllllllll III 

Db 4 87 ATGCCCAAGACCCAG 501 

Qy 301 AGAAGGAAAGGAAGTACATTTGAAGAACA(^GTAGAGGGAGTGCAGGAAAaiAGAACTA 360 

Db 5 02 AAGGAAGTACATTTGAAGAACGCAAGTAGAGGGAGTGCAGGAAACAAGAACTA 554 

Qy 361 CAGGATGTAGGAAGACC CTT CTGAGGAGTGAAGAAGGACAGG CCAC CGCAGGACCCTT 418 

II I Ml II MM lllllll llllllllllllll 1 1 1 1 1 1 1 1 II 1 1 1 1 II MM 

Db 555 CAGGATGTAGGAAGACCCTCCTGAGGAGTGAAGAGTGACATG CCACCG CAGGAT CCTT 612 



RESULT 13 
US-09-852-261-11 

; Sequence 11, Application US/09852261 

; Patent No. US20020083477A1 

; GENERAL INFORMATION: 

; APPLICANT: GOLDSPINK, GEOFFREY 

; APPLICANT: TERENGHI , GIORGIO 

; TITLE OF INVENTION: REPAIR OF NERVE DAMAGE 

; FILE REFERENCE: 117-351 

; CURRENT APPLICATION NUMBER: US/ 09/ 852 , 261 
; CURRENT FILING DATE: 2001-05-10 
; PRIOR APPLICATION NUMBER: GB 0011278.9 
; PRIOR FILING DATE: 2000-05-10 
NUMBER OF SEQ ID NOS : 14 
SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 11 
LENGTH: 487 
TYPE: DNA 

ORGANISM: RattUS sp . 
US-09-852-261-11 

Query Match 50.1%; Score 262; DB 9; Length 487; 

Best Local Similarity 74.7%; Pred. No. 1.5e-75; 

Matches 396; Conservative 0; Mismatches 75; Indels 59; Gaps 3; 
Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Mill Mill M Mill II Mill I Mill I I MUM II I MM II Mill 

Db 1 GGACCAGAGACCCTTTGCGGGGCTGAGCTGGTGGACGCTCTTCAGTTCGTGTGTGGACCA 60 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCC^ 120 

II II I II I II I II II II I II II II M I II IIMMIMI IMIIIMIIM III 

Db 61 AGGGGCTTTTACTTO^CAAGCCCAC^GTCTATGGCT 120 

Qy 121 ACAGGCATCGTGGATGAGTG CTG CTTC CGGAG CTGTGATCTGAGGAGG CTGGAGATGTAC 180 

II Mill Ml II I Mill IMIIIMMIIMIIIIIIIIIIIMIIIMIIIIIII 

Db 121 ACGGGCATTGTGGATGAGTGTTGCTTCCGGAGCTGTGATCTGAGGAGGCTGGAGATGTAC 18 0 

Qy 181 TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 24 0 

1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I II 

Db 181 TGTGTCCGCTGCAAGCCTACAAAGTCAGCTCGTTCCATCCGGGCCCAGCGCCACACTGAC 24 0 

Qy 241 ATG CCCAAGACTCAGAAGTATCAG CCT CCATCTACCAACAAGAAAATGAAGTCTCAGAGG 300 

IIIIIIIIIIIIIM 

Db 241 ATGCCCAAGACTCAG 255 



Qy 301 AGAAGGAAAGGAAGTACATTTGAAGAACACAAGTAGAGGGAGTGCAGGAAACAAGAACTA 360 

1,11; IIIIIIIIIIIIIMIIIII Mil I Ml I III 

Db 256 AAGGAAGTACACTTGAAGAACACAAGTAGAGGAAGTGCAGGAAACAAGACCTA 308 

Qy 3 61 CAGGATGTAGGAAGACCCTTCTGAGGAGTGAAGAAGGACAGGCCACCGCAGGACCCTTTG 42 0 

III IMIIIII M II I I Mill I II I II I IIIIMI II MUM 

Db 3 09 CAGAATGTAGGAGGAGCCTCCCGAGGAACAGAAAATGCCACGTCACCGCAAGATCCTTTG 368 
Qy 421 CTCTGCACAGTTACCTGTAAACATTGGAATACCGGCCA AAAAATAAGTTTGATC 474 

II I 1 1 1 1 1 1 1 1 1 1 Ml Mil II III MM II 

Db 369 CTGCTTGAGCAACCTGCAAAACATCGGAACACCTGCCAAATATCAATAATGAGT 42 8 

Qy 4 75 ACATTTCAAAGAT - GGCATTTCCCCCAATGAAATACACAAGTAAACATTC 523 

iiiiiii 1 1 1 1 iiiiiiiiM :i 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 ii 1 1 1 

Db 42 9 TCATTTCAGAGATGGGCATTTCCCTCAATGAAATACACAAGTAAACATTC 478 



RESULT 14 
US-09-852-261-9 

; Sequence 9, Application US/09852261 

; Patent No. US20020083477A1 

; GENERAL INFORMATION: 

; APPLICANT: GOLDSPINK, GEOFFREY 

; APPLICANT: TERENGHI , GIORGIO 

; TITLE OF INVENTION: REPAIR OF NERVE DAMAGE 

; FILE REFERENCE: 117-351 

; CURRENT APPLICATION NUMBER: US/09/852 , 261 

; CURRENT FILING DATE: 2001-05-10 

; PRIOR APPLICATION NUMBER: GB 0011278.9 

PRIOR FILING DATE: 2000-05-10 
; NUMBER OF SEQ ID NOS : 14 

SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 9 ' 
LENGTH: 318 
TYPE: DNA 
; ORGANISM: Homo sapiens 
US-09-852-261-9 

Query Match 45.4%; Score 237.6; DB 9; Length 318; 

Best Local Similarity 94.6%; Pred. No. 1.3e-67; 

Matches 246; Conservative 0; Mismatches 14; Indels 0; Gaps 0; 
Qy 1 GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

IIIIMIIIIIIMMIIII IIIMIIIIIIIMMIIIIMIMIIMIIIMIIIII 

Db 1 GGACCGGAGACGCTCTGCGGGGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 60 

Qy 61 AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 12 0 

II IMIIIII IIIIIIIMMM II , M 1 1 M M 1 1 1 1 1 1 1 I MUM 

Db 61 AGGGGCTTTTATTTCAACAAGCCCACAGGGTATGGCTCCAGCAGTCGGAGGGCGCCTCAG 120 

Qy 121 ACAGG CATCGTGGATGAGTGCTG CTT CCGGAG CTGTGATCTGAGGAGG CTGGAGATGTAC 180 

Mh I III I ! I Ml I I I I I'l I III Ml I N 

Db 121 ACAGG CATCGTGGATGAGTGCTG CTTCCGGAGCTGTGATCTAAGGAGG CTGGAGATGTAT 18 0 

Qy 181 TGTGCACCCCT(L^GCCGGCAAAGGCAGCCCGCTCCGTC^ 24 0 

II IMMMIMIMI II Ml MM Mill MMMIIIMMMMMIMM 



Db 181 TGCGCACCCCTC^GCCTGCC^AGTCAGCTCGCTCTGTCCGTGCCCAGCGCGACACCGAC 24 0 



Qy 241 ATGCCCAAGACTCAGAAGTA 260 

IIIIIIIIIII MINI I 

Db 241 ATGCCCAAGACCCAGAAGGA 260 



RESULT 15 
US-10-238-114-1 

; Sequence 1, Application US/10238114 
; Publication No. US20030100073A1 
; GENERAL INFORMATION: 
; APPLICANT: Merial 

; APPLICANT: ANDREONI , Christine Michele 

; TITLE OF INVENTION: IGF-1 AS FELINE VACCINE ADJUVANT, IN PARTICULAR AGAINST 
FELINE RETROVIRUS 

; FILE REFERENCE: 454313-3165-1 

; CURRENT APPLICATION NUMBER: US/lO/238 , 114 

; CURRENT FILING DATE: 2002-09-10 

; PRIOR APPLICATION NUMBER: FR 01 11736 

; PRIOR FILING DATE: 2001-09-11 

; PRIOR APPLICATION NUMBER: US 60/318 , 666 

PRIOR FILING DATE: 2001-09-12 
; NUMBER OF SEQ ID NOS : 2 0 

SOFTWARE: Patent In version 3.1 
; SEQ ID NO 1 

LENGTH: 4 62 
TYPE: DNA 

ORGANISM: Felis catus 
US-10-238-114-1 



Query Match 43.6%; Score 228; DB 15; Length 462; 

Best Local Similarity 92.3%; Pred. No. 2.3e-64; 

Matches 240; Conservative 0; Mismatches 20; Indels 0; Gaps 0; 



Qy 


l 


GGACCGGAGACGCTCTGCGGTGCTGAGCTGGTGGATGCTCTTCAGTTCGTGTGTGGAGAC 

Mill IIIIIIIIIII II llllll IIIIIII M I II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 

GGACCAGAGACGCTCTGTGGGGCTGAGTTGGTGGACGCTCTTCAGTTCGTGTGTGGAGAC 


60 


Db 


145 


204 


Qy 


61 


AGGGGCTTTTATTTCAACAAGCCCACAGGATACGGCTCCAGCAGTCGGAGGGCACCTCAG 


120 


Db 


205 


Mill IIIIIII II II MMMIMMMMMMIMIMM 

AGGGGTTTTTATTTCAACAAGCCCACGGGGTATGGCTCCAGCAGTCGGAGGGCACCTCAG 


264 


Qy 


.121 


ACAGGCATCGTGGATGAGTG CTG CTTCCGGAG CTGTGATCTGAGGAGG CTGGAGATGTAC 


180 


Db 


265 


IMMMMMIIMMMIMIMMMMMMIMMIMM MM MM 

ACAGGCATCGTGGATGAGTGCTGCTTCCGGAGCTGTGATCTGAGGCGGCTAGAGATGTAC 


324 


Qy 


181 


TGTGCACCCCTCAAGCCGGCAAAGGCAGCCCGCTCCGTCCGTGCCCAGCGCCACACCGAC 


240 


Db 


325 


II IIIIIIIIIII III 1 1 Ml 1 II Mil II llllllll IIIIIIIIIII III 

TGTGCACCCCTCAAGCCTGCCAAGTCTGCCCGCTCAGTCCGTGCTCAGCGCCACACTGAC 


384 


Qy 


241 


ATGCCCAAGACTCAGAAGTA 260 




Db 


385 


Illllllll llllllll 1 
ATGCCCAAGGCTCAGAAGGA 4 04 
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